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SEQUENCE LISTING 

<110> Schupp, Thomas 
Ligon, James 
Molnar, Istvan 
Zirkle, Ross 
Cyr , Devon 
Goerlach, Joern 

<120> GENES FOR THE BIOSYNTHESIS OF EPOTHILONES 

<130> 4-30582A 

<140> 
<141> 

<160> 30 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 68750 
<212> DNA 

<213> Sorangium cellulosum 
<400> 1 

aagcttcgct cgacgccctc ttcgcccgcg ccacctctgc ccgtgtgctc gatgatggcc 60 
acggccgggc cacggagcgg catgtgctcg ccgaggcgcg cgggatc£ag gacctccgcg 120 
ccctccgaga gcacctccgc atccaggaag gggggccgtc ctttcactgc atgtgcctcg 180 
gcgacctgac ggtggagctc ctcgcgcacg accagcccct cgcgtccatc agcttccacc 240 
atgcccgcag cctgaggcac cccgactgga cctcggacgc gatgctcgtc gacggccccg 300 
cgctcgtccg gtggctcgcc gcgcgcggcg cgccgggtcc cctccgcgag tacgaagagg 360 
agcgcgagcg agcccgaacc gcgcaggagg cgaggcgcct gtggctcgcg gccgcgccgc 420 
cctgcttcgc gcccgatctg ccccgcttcg aggacgacgc caacgggctg ccgctcggcc 480 
cgatgtcgcc tgaagtcgcc gaggccgagc ggcgcctccg cgcctcgtac gcgactcctg 540 
agctcgcctg tgccgcgctg ctcgcctggc tcgggacggg cgcgggtccc tggtccggat 600 
atcccgccta cgagatgctg ccagagaatc tgctcctcgg gtttggcctc ccgaccgcga 660 
tcgccgcggc ctccgcgccc ggcacatcgg aggccgctct ccgcggcgca gcgcggctgt 720 
tcgcctcctg ggaggtcgta tcgagcaaga agagccagct cggcaacatc cccgaagccc 780 
tgtgggagcg gctccggacg atcgtccgcg cgatgggcaa tgccgacaac ctctctcgct 840 
tcgagcgcgc cgaggcgatc gcggcggagg tgcgccgcct gcgcgcacag ccggcgccct 900 
tcgcggcggg cgccggcctg gcggtcgctg gggtctcctc gagcggccgg ctctcgggcc 960 
tcgtgaccga cggagacgca ttgtactccg gcgacggcaa cgacatcgtc atgttccaac 1020 
ccggccggat ctcgccggtc gtgctgctcg ccggaaccga tcccttcttc gagctcgcac 1080 
cgcccctcag ccagatgctc ttcgtcgcgc acgccaacgc gggcaccatc tccaaggtcc 1140 
tgacggaagg cagccccctc atcgtgatgg caagaaacca ggcgcgaccg atgagcctcg 1200 
tccacgctcg cgggttcatg gcgtgggtca accaggccat ggtgcccgac cccgagcggg 1260 
gcgcgccctt cgtcgtccag cgctcgacca tcatggaatt cgagcacccc acgcctcgtt 1320 
gtctccacga gcccgccggc agcgctttct ccctcgcctg cgacgaggag cacctctact 1380 
ggtgcgagct ttcggctggc cggctcgagc tatggcgcca cccgcaccac cgccccggcg 1440 
ccccgagccg cttcgcgtac ctcggcgagc accccattgc ggcgacctgg tacccctcgc 1500 
tcaccctcaa tgcgacccac gtgctgtggg ccgaccctga tcgcagggcc atcctcgggg 1560 
tcgacaagcg caccggcgta gagcccatcg tcctcgcgga gacgcgccat cccccggcgc 1620 
acgtcgtgtc cgaggaccgg gacatcttcg cgcttaccgg acagcccgac tcccgcgact 1680 
ggcacgtcga gcacatccgc tccggcgcct ccaccgtcgt ggccgactac cagcgccagc 1740 
tatgggaccg ccctgacatg gtgctcaatc ggcgcggcct cttcttcacg accaacgacc 1800 
gcatcctgac gctcgcccgc agctgacatc gctcgacgcc gggccgctca tcgagggcgc 1860 
ccggaccgag ctggcgaccc gccgctggcg ggccgcagct catgccgatt cggtggcgac 1920 
gtagacgctg cgccagaaac gctcgagagc ccccgagaac aggaagccgg cggattgtgt 1980 
catcacgatc ccgatcagct cgcggcccgg atcattgatc caggacgtcc cgaacccgcc 2040 
gtcccaccca tagcgcccgg gcacctccga gaccgcgtcc ggcgccgtga ccacggccat 2100 
cccataaccc cagccgtgcg tctcgaagaa gcccgggaaa aacgaggacg ccgccttctg 2160 
ggccggcgtg aggtgatcgg ccgtcatctc gcgcaccgag gcggcgctca agagccgccg 2220 
gccctcgtgc acaccgccgt tcatgagcat gcgcgcgaac aggaggtagt cgtccaccgt 2280 
cgacacgagc ccggcggcgc ccgaagggaa cgccggcggg ctggcatagg cgctctcggc 2340 
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cccgtcgcga tccatgcgcg tcttctcccc cgtctgctcg tcggtgaagt aaccgcagcc 2400 
cgcgaaccga gcgagcttgt ccgccgggac gtgaaagtcg gtgtcccgca tcccgagcgg 2460 
cgcgaggatg cgctcgcgca cgaacgcatc gaagccctgg tcggccgcgc gccccacgag 2520 
caccccctgc accaggctcc ccgtgttgta catccactgc gcccccggct gatgcatgag 2580 
cggcagcgtc ccgagccgcc ggatccactc gtctggcccg tgcggcgtca tcggcaccgg 2640 
ctgcgcgttg acgagcccga gctcgtcgat ggcccgctgg atcggcgacg atgcgtcgaa '2700 
cgagattccg aagcccatcg tgaacgtcat caggtcgcgc accgtgatcg gccgctccgc 2760 
gggcaccgtc tcgtcgatcg gaccatcgat gcgcgccagc accttccggt tcgcgagctc 2820 
cggcaaccat cggtcgacgg gggagtcgag gtcgagcttg ccttcctcga cgagcatcat 2880 
caccgccgtc gcggtgaccg ccttcgtcat cgaggcgatc cggaagatcg tgtcccgccg 2940 
catgggcgcg ctgccgccga gctcggtcac gcccaccgcg tccacgtgca cgtcgtcgcc 3000 
gcgcgcgacc agccagaccg ctcccggcat ctgccccgcc gccacctccg ccgccatcac 3060 
ctcgcgcgcg ggcgccagcg cgccggcccc cgcgtcctgc cctggctgcc cctcctcctc 3120 
ggccccaccc aacgcgcacc ccggcgccgc cacgctgatc aaagctccca taaactcccg 3180 
ccttctcatg accgtcgatg cctctccgag cgggggcgcc tgcccctgcc gagagcactg 3240 
actgcccgcg cccgaaaaaa tcatcggtgc cccgtcacga tcgccgccgg gcgtggctcc 3300 
gcccggccgc ccgctcgggc gcccgcccct ggacgagcaa agctcgcccg cccgcgctca 3360 
gcacgccgct tgccatgtcc ggcctgcacc cacaccgagg agccacccac cctgatgcac 3420 
ggcctcaccg agcggcaggt cctgctctcg ctcgtcaccc tcgcgctcat cctcgtgacc 3480 
gcgcgcgcct ccggcgagct cgcgcggcgg ctgcgccagc ccgaggtgct cggggagctc 3540 
ttcggcggcg tcgtgctggg cccctccgtc gtcggcgcgc tcgcgcccgg gttccatcga 3600 
gccctcttcc aggagccggc ggtcggggtc gtgctctcgg gcatctcctg gataggcgcg 3660 
ctcctcctgc tgctgatggc gggcatcgag gtcgacgtgg gcatcctgcg caaggaggcg 3720 
cgccccgggg cgctctcggc gctcggcgcg atcgcgcccc cgctcgcggc gggcgccgcc 3780 
ttctcggcgc tcgtgctcga tcggcccctt ccgagcggcc tcttcctcgg gatcgtgctc 3840 
tcggtgacgg cggtcagcgt gatcgcgaag gtgctgatcg agcgcgagtc gatgcgccgc 3900 
agctatgcgc aggtgacgct cgcggcgggg gtggtcagcg aggtcgctgc ctgggtgctc 3960 
gtcgcgatga cgtcgtcgag ctacggcgcg tcgcccgcgc tggcggtcgc ccggagcgcg 4020 
ctcctggcga gcggattctt gctgttcatg gtgctcgtcg ggcggcggct cacccacctc 4080 
gcgatgcgct gggtggccga cgcgacgcgc gtctccaagg gacaggtgtc gctcgtcctc 4140 
gtcctcacgt tcctggccgc ggcgctgacg cagcggctcg gcctgcaccc gctgctcggc 4200 
gcgttcgcgc tcggcgtgct gctcaacagc gctcctcgca ccaaccgccc tctcctcgac 4260 
ggcgtgcaga cgctcgtggc gggcctcttc gcgcctgtgt tcttcgtcct cgcgggcatg 4320 
cgcgtcgacg tgtcgcagct gcgcacgccg gcggcgtggg ggacggtcgc gttgctgctg 4380 
gcgaccgcga cggcggcgaa ggtcgtcccc gccgcgctcg gcgcgcggct cggcgggctc 4440 
aggggcagcg aggcggcgct cgtggcggtg ggcctgaaca tgaagggcgg cacggacctc 4500 
atcgtcgcga tcgtcggcgt cgagctcggg ctcctctcca acgaggctta tacgatgtac 4560 
gccgtcgtcg cgctggtcac ggtgaccgcc tcacccgcgc tcctcatctg gctcgagaaa 4620 
agggcgcctc cgacgcagga ggagtcggct cgcctcgagc gcgaggaggc cgcgaggcgc 4680 
gcgtacatcc ccggggtcga gcggatcctc gtcccgatcg tggcgcacgc cctgcccggg 4740 
ttcgccacgg acatcgtgga gagcatcgtc gcctccaagc gaaagctcgg cgagacggtc 4800 
gacatcacgg agctctccgt ggagcagcag gcgcccggcc catcgcgcgc cgcgggggag 4860 
gcgagccggg ggctcgcgag gctcggcgcg cgcctccgcg tcggcatctg gcggcaaagg 4920 
cgcgagctgc gcggctcgat ccaggcgatc ctgcgcgcct cgcgggatca cgatctgctc 4980 
gtgatcggcg cgcgatcgcc ggcgcgcgcg cgcggaatgt cgttcggtcg cctgcaggac 5040 
gcgatcgtcc agcgggccga gtccaacgtg ctcgtcgtgg tgggcgaccc tccggcggcg 5100 
gagcgcgcct ccgcgcggcg gatcctcgtc ccgatcatcg gcctcgagta ctccttcgcc 5160 
gccgccgatc tcgcggccca cgtggcgctg gcgtgggacg ccgagctcgt gctgctcagc 5220 
agcgcgcaga ccgatccggg cgcggtcgtc tggcgcgatc gcgagccatc ccgggtgcgc 5280 
gcggtggcgc ggagcgtcgt cgacgaggcg gtcttccggg ggcgccggct cggcgtgcgc 5340 
gtctcgtcgc gcgtgcacgt gggcgcgcac ccgagcgacg agataacgcg ggagctcgcg 5400 
cgcgccccgt acgatctgct cgtgctcgga tgctacgacc atgggccgct cggccggctc 5460 
tacctcggca gcacggtcga gtcggtggtg gtccggagcc gggtgccggt cgcgttgctc 5520 
gtcgcgcatg gagggactcg agagcaggtg aggtgaggct tccaccgcgc tcgcccgtga 5580 
ggaagcgagc gcccggctct gccgacgatc gtcactcccg gtccgtgtag gcgatcgtgc 5640 
tgagcagcgc gttctccgcc tgacgcgagt cgagccgggt atgctgcacg acgatggggg 5700 
cgtccgattc gatcacgctg gcatagtccg tatcgcgcgg gatcggctcg ggttcggtca 5760 
gatcgttgaa ccggacgtgc cgggtgcgcc tcgctggaac ggtcacccgg taaggcccgg 5820 
cggggtcgcg gtcgctgaag taaacggtga tggcgacctg cgcgtcccgg tccgacgcat 5880 
tcaacaggca ggccgtctca tggctcgtca tctgcggctc aggtccgttg ctcccgcctg 5940 
ggatgtagcc ctctgcgatt gcacagcgcg tccgcccgat cggcttgtcc atgtgtcctc 6000 
cctcctggct cctctttggc agcctccctc tgctgtccag gagcgatggc ctcttcgctc 6060 
gacgcgctcg gggatccatg gctgaggatc ctcgccgagc gctccctgcc gaccggcgcg 6120 
ccgagcgccg acgggctttg aaagcgcgcg accggccagc ccggacgcgg gcccgagagg 6180 
gacagtgggt ccgccgtgaa gcagagaggc gatcgaggtg gtgagatgaa acacgtcgac 6240 
acgggccgac gattcggccg ccggataggg cacacgctcg gtcttctcgc gagcatggcg 6300 
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ctcgccggct 
gccgatgcgc 
gacgtcgttc 
tggcagcgtc 
gacagccgaa 
atcaccgccg 
acgtctgtcg 
cctgcctcga 
acggccgcgt 
cctggcgcgg 
cgcgatcttt 
cgccgtgacc 
ctctacgaca 
cccacgctgc 
cgcggcaacg 
aacgtcgtcg 
ggggctggcc 

aaaatttgtc 
ctcgagcgag 
ctcaggaagc 
gggattagat 
aggacgatcg 
tcccatcgag 
cggtggcgtg 
cgggcgagtc 
cccggggaag 
ctccttcttc 
gctggaggtg 
tacggaaacg 
agcgacggcg 
agcgggccga 
ctattcgtcc 
ctccacggcc 
ctcgaagacc 
tgggttcgga 
cgcggacggc 
gagcagcggt 
ggcggacgca 
gacgcttggt 
tgtcgccacg 
gtcggggatc 
gcacctccac 
cacgcgcgcc 
gttcggcatg 
cacaccgccg 
agccctggat 
tctgggcgat 
ggtggcggcg 
gacgtcgccc 
cttcaccgga 
cgcgttccgc 
gctccgcgag 
agccttcacc 
gtggggtgta 
cgtggcgggc 
gatgcaggcg 
ggctgctgcg 
ccaggtggtc 
gcgcggggcg 
cccgatgctg 
cgtcctggtc 
ttgggtgcgc 
ggccggtgcg 
tgcctgcatg 
gccggcgacc 
ggccggcctc 



gcggcggtcc 
gcgtcaccgc 
acctctcgcc 
cgagccccga 
gaggcaagct 
agaagcagag 
ggtgacatcg 
tggccttccc 
ccaagcaacc 
cctgcgcatc 
gttgccatcg 
ggcagctcgg 
acctcaagag 
tggctctgtc 
agaagggccg 
taaccgccca 
ggggtcgacg 
atagatcgta 
ttctctcgga 
ctgatcggga 
cgagtgagac 
acgtccgcga 
cgcgcagccg 
atcgatctga 
cccgccgaac 
acgcccgtta 
ggcatctcgc 
tgctgggagg 
ggagtgttca 
tccgcagaga 
atctcgtatg 
tcgctggtgg 
ctggctggtg 
cgggcgctgg 
cgaggcgaag 
gatcggatat 
ctgaccgtgc 
ggctgcgccg 
gaccccatcg 
ccgctgctga 
actgggctgc 
gcgcaggcgc 
cggacaccgt 
agcgggacca 
gcgccggagc 
gcacaggcgg 
gtggcgttca 
acgtcgaggg 
ggtgcggtgc 
cagggggcgc 
gaggcgttcg 
gtgatgtggg 
cagccggcgc 
gagccggagt 
gtgttctcgc 
ctgccggccg 
gtggcgccgc 
atcgcgggcg 
cgaaccaagg 
gaggcgttcg 
agcaatctga 
cacgcgcgag 
ggcaccttcg 
ccggacgccc 
gtgctcgagg 
ttcccctcag 



gagcgagaaa 
cgacgtcgac 
gcccgagcgg 
gtccccgtgg 
ggccgagacg 
cagccctcag 
cgctatcagc 
catcacccct 
gccgtgccgg 
gctggaagga 
cagccaacgt 
cgaggcgttt 
cgccgtcgtc 
ggcgcattac 
cgtccagcgc 
gcaatgtcat 
atatcgcgcg 
agctgtgata 
taactttcaa 
cgagctaatt 
agttcttttg 
gcgggtcagc 
aagatccgat 
gcgggttctg 
gctgggatgc 
cgcgcgcatc 
ctcgcgaagc 
cgctggagaa 
tcgggatcgg 
tcgacgctca 
ccctcgggct 
ccgttcatct 
gggtatcgct 
ccagggacgg 
ggtgcgccgt 
tggcggtgat 
cgaacgggag 
cgtcttcggt 
aaatccaagc 
tcgggtcggt 
tgaaggtcgt 
tgaacccccg 
ggccggactg 
acgcgcacgt 
gaccggcaga 
cgcggctgcg 
gtctggcgac 
aggggctgcg 
gcagtatcgc 
agacgctggg 
acctgtgcgt 
ccgaaccggc 
tgttcacctt 
tggtcgccgg 
ttgaggacgc 
gcggggcgat 
acgcagcgtc 
ccgggcaacc 
cgctccacgt 
ggcgtgtggc 
gcgggaaggc 
aggtggtgcg 
tcgaggtcgg 
ggccggcgct 
cgctcggcgg 
gggggcggcg 



accgtgcagg 
cccgacgccg 
ctcgaggccg 
cgacgggtcg 
accgtgccgt 
tcgccatcgt 
agcgctgagc 
gcgcactcct 
cgcggctcca 
tcggcggagc 
ggcgatcgcg 
gccgacttcg 
gagcgccacg 
cgcttcgagc 
gccatcacgg 
gggaatggcc 
atctccccgt 
gtggtctgtc 
tttttccgag 
cccatccatt 
cagtgcgcga 
cgctgaggat 
tgcgatcgtc 
gacgctcctc 
agcagcgtgg 
tttcctgagc 
gctgcggatg 
cgccgcgatc 
cccgtccgaa 
tggcgggctg 
gcgagggccg 
ggcctgtcag 
gatgttgtcg 
tcgctgcaag 
cgtggtcctc 
tcgaggatcc 
ctcccaagaa 
gggttatgtc 
tctgaatgcg 
gaagaccaac 
cttgtccctt 
gatctcatgg 
gaatacgccg 
ggtgctggaa 
gctgctggtg 
cgaccatctg 
gacgcgcagc 
ggcagccctg 
cgattcctca 
catgggccgt 
gaggctgttc 
cagcgtcgac 
cgaatatgcg 
ccatagcatc 
ggtgttcctg 
ggtgtcgatc 
ggtgtcgatc 
cgtgcatgcg 
ctcgcatgcg 
cgagtcggtg 
ttgcacagac 
cttcgcggat 
tccgaaatcg 
gctcgcatcg 
gctctgggcc 
ggtgccgctg 



gcacgcggct 
cgaccacgcg 
gcagcgagcg 
gagtgctcga 
atgccaactt 
ctgccgccgt 
ccgccagcag 
ccagcgacgg 
cgcgcgcgac 
atggatagag 
gcggtcaagt 
gcggcgtccc 
gcgacgcgat 
cgcgccccgt 
cgtggacgac 
ccttgaaatg 
caattcccga 
ttacgttgcg 
gggggcttgg 
tttttgaggc 
agaacctggg 
gtgcccgtcg 
ggagcgagtt 
gagggctcgc 
tttgatcccg 
gacgtagcct 
gaccctgcac 
gctccatcgg 
tatgaggccg 
gggacgatgc 
tgtgtcgcgg 
agcttgcgct 
ccgagcaccc 
gcattttcgg 
aagcggctca 
gcgatcaatc 
atcgtgctga 
gaggcacacg 
gtatacggcc 
cttggccatc 
cagcacgggc 
ggtgatcttc 
cgacgggcgg 
gaggcgccgg 
ctgtcggcaa 
gagacctacc 
gcgatggagc 
gacgctgcgg 
cgcggcaagc 
gggctgtacg 
aaccaggagc 
gccgcgctgc 
ctcgccgcgc 
ggtgagctgg 
gtggctgcgc 
gaggcgccgg 
gccgcggtca 
atcgcggcgg 
ttccactcac 
agctaccggc 
gaggtgagct 
ggagtgaagg 
acgctgctcg 
tcgcgcgctg 
gtcggtggcc 
cccacgtacc 



cgcgcccggc 
gctggcggtg 
gttcgtcgtc 
ctacaatgct 
cgagctgctc 
catcgggccg 
gccccagggc 
ccgcgcagcg 
aggcgagcgt 
aatcgaggat 
tcatcgccgc 
gcgcgtgctg 
ccggttccac 
cgccgtcgcc 
atggcgcgga 
gccccttgag 
tggtaaaaga 
tcttccgcac 
tctctggttc 
tctgctcaaa 
cctcgaccgg 
tggcggatcg 
gccgtctgcc 
gcgacaccgt 
accccgatgc 
gcttcgacgc 
atcgactctt 
cgctcgtcgg 
cgctgccgca 
ccagcgtcgg 
tggatacggc 
ccggggaatg 
tcgtgtggct 
cggaggccga 
gtggagcccg 
acgacggtgc 
aacgggccct 
gcacgggcac 
tcgggcgaga 
ctgagtatgc 
agattcctgc 
ggctgaccgt 
gggtgagctc 
cggcgacgtg 
ggaccgcgtc 
cttcgcagtg« 
accggctcgc 
egcagggaca 
tcgcctttct 
atgtatggtc 
tcgaccggcc 
tcgaccagac 
tgtggcggtc 
tggctgcctg 
gcgggcgcct 
aggccgatgt 
acgctccgga 
cgatggccgc 
cgctcatggc 
ggccgtcgat 
cgccgggcta 
cgctgcacgc 
gcctggtgcc 
ggcgtgacga 
tggtctcctg 
cttggcagcg 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
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cgagcgctac tggatcgaca cgaaagccga cgacgcggcg cgtggcgacc gccgtgctcc 10320 
gggagcgggt cacgacgagg tcgaggaggg gggcgcggtg cgcggcggcg accggcgcag 10380 
cgctcggctc gaccatccgc cgcccgagag cggacgccgg gagaaggtcg aggccgccgg 10440 
cgaccgtccg ttccggctcg agatcgatga gccaggcgtg cttgatcacc tcgtgcttcg 10500 
ggtcacggag cggcgcgccc ctggtctggg cgaggtcgag atcgccgtcg acgcggcggg 10560 
gctcagcttc aatgatgtcc agctcgcgct gggcatggtg cccgacgacc tgccgggaaa 10620 
gcccaaccct ccgctgctgc tcggaggcga gtgcgccggg cgcatcgtcg ccgtgggcga 10680 
gggcgtgaac ggcctcgtgg tgggccaacc ggtcatcgcc ctttcggcgg gagcgtttgc 10740 
tacccacgtc accacgtcgg ctgcgctggt gctgcctcgg cctcaggcgc tctcggcgat 10800 
cgaggcggcc gccatgcccg tcgcgtacct gacggcatgg tacgcgctcg acagaatagc 10860 
ccgccttcag ccgggggagc gggtgctgat ccatgcggcg accggcgggg tcggtctcgc 10920 
cgcggtgcag tgggcgcagc acgtgggagc cgaggtccat gcgacggccg gcacgcccga 10980 
gaaacgcgcc tacctggagt cgctgggcgt gcggtatgtg agcgattccc gctcggaccg 11040 
gttcgtcgcc gacgtgcgcg cgtggacggg cggcgaggga gtagacgtcg tgctcaactc 11100 
gctctcgggc gagctgatcg acaagagttt caatctcctg cgatcgcacg gccggtttgt 11160 
ggagctcggc aagcgcgact gttacgcgga taaccagctc gggctgcggc cgttcctgcg 11220 
caatctctcc ttctcgctgg tggatctccg ggggatgatg ctcgagcggc cggcgcgggt 11280 
ccgtgcgctc ttggaggagc tcctcggcct gatcgcggca ggcgtgttca cccctccccc 11340 
catcgcgacg ctcccgatcg cccgtgtcgc cgatgcgttc cggagcatgg cgcaggcgca 11400 
gcatcttggg aagctcgtac tcacgctggg tgacccggag gtccagatcc gtattccaac 11460 
ccacgca.ggc gccggcccgt ccaccgggga tcgggacctg ctcgacaggc tcgcgtcagc 11520 
tgcgccggcc gcgcgcgcgg cggcgctgga ggcgttcctc cgtacgcagg tctcgcaggt 11580 
gctgcgcacg cccgaaatca aggtcggcgc ggaggcgctg ttcacccgcc tcggcatgga 11640 
ctcgctcatg gccgtggagc tgcgcaatcg tatcgaggcg agcctcaagc tgaagctgtc 11700 
gacgacgttc ctgtccacgt cccccaatat cgccttgttg gcccaaaacc tgttggatgc 11760 
tctcgccaca gctctctcct tggagcgggt ggcggcggag aacctacggg caggcgtgca 11820 
aaacgacttc gtctcatcgg gcgcagatca agactgggaa atcattgccc tatgacgatc 11880 
aatcagcttc tgaacgagct cgagcaccag ggtatcaagc tggcggccga tggggagcgc 11940 
ctccagatac aggcccccaa gaacgccctg aacccgaacc tgctcgctcg aatctccgag 12000 
cacaaaagca cgatcctgac gatgctccgt cagagactcc ccgcagaatc catcgtgccc 12060 
gccccagccg agcggcacgc tccgtttcct ctcacagaca tccaagaatc ctactggctg 12120 
ggccggacag gagcgtttac ggtccccagc gggatccacg cctatcgcga atacgactgt 12180 
acggatctcg acgtgccgag gctgagccgc gcctttcgga aagtcgtcgc gcggcacgac 12240 
atgcttcggg cccacacgct gcccgacatg atgcaggtga tcgagcctaa agtcgacgcc 12300 
gacatcgaga tcatcgatct gcgcgggctc gaccggagca cacgggaagc gaggctcgtg 12360 
tcgttgcgag atgcgatgtc gcaccgcatc tatgacaccg agcgccctcc gctctatcac 12420 
gtcgtcgccg ttcggctgga cgagcggcaa acccgtctcg tgctcagtat cgatctcatt 12480 
aacgttgacc taggcagcct gtccatcatc ttcaaggact ggctcagctt ctacgaagat 12540 
cccgagacct ctctccctgt cctggagctc tcgtaccgcg attatgtact cgcgctggag 12600 
tctcgcaaga agtctgaggc gcatcaacga tcgatggatt actggaagcg gcgcatcgcc 12660 
gagctcccac ctccgccgac gcttccgatg aaggccgatc catctaccct gaaggagatc 12720 
cgcttccggc acacggagca atggctgccg tcggactcct ggggtcgatt gaagcggcgt 12780 
gtcggggagc gcgggctgac cccgacgggc gtcatcctgg ctgcattttc cgaggtgatc 12840 
gggcgctgga gcgcgagccc ccggtttacg ctcaacataa cgctcttcaa ccggctcccc 12900 
gtccatccgc gcgtgaacga tatcaccggg gacttcacgt cgatggtcct cctggacatc 12960 
gacaccactc gcgacaagag cttcgaacag cgcgctaagc gtattcaaga gcagctgtgg 13020 
gaagcgatgg atcactgcga cgtaagcggt atcgaggtcc agcgagaggc cgcccgggtc 13080 
ctggggatcc aacgaggcgc attgttcccc gtggtgctca cgagcgcgct taaccagcaa 13140 
gtcgttggtg tcacctcgtt gcagaggctc ggaactccgg tgtacaccag cacgcagact 13200 
cctcagctgc tgctggatca tcagctctac gagcacgatg gggacctcgt cctcgcgtgg 13260 
gacatcgtcg acggagtgtt cccgcccgac cttctggacg acatgctcga agcgtacgtc 13320 
gtttttctcc ggcggctcac tgaggaacca tggggtgaac aggtgcgctg ttcgcttccg 13380 
cctgcccagc tagaagcgcg ggcgagcgca aacgcgacca acgcgctgct gagcgagcat 13440 
acgctgcacg gcctgttcgc ggcgcgggtc gagcagctgc ccatgcagct cgccgtggtg 13500 
tcggcgcgca agacgctcac gtacgaagag ctttcgcgcc gttcgcggcg acttggcgcg 13560 
cggctgcgcg agcagggggc acgcccgaac acattggtcg cggtggtgat ggagaaaggc 13620 
tgggagcagg ttgtcgcggt tctcgcggtg ctcgagtcag gcgcggccta cgtgccgatc 13 680 
gatgccgacc taccggcgga gcgtatccac tacctcctcg atcatggtga ggtaaagctc 13740 
gtgctgacgc agccatggct ggatggcaaa ctgtcatggc cgccggggat ccagcggctg 13800 
ctcgtgagcg aggccggcgt cgaaggcgac ggcgaccagc ctccgatgat gcccattcag 13860 
acaccttcgg atctcgcgta tgtcatctac acctcgggat ccacagggtt gcccaagggg 13920 
gtgatgatcg atcatcgggg tgccgtcaac accatcctgg acatcaacga gcgcttcgaa 13980 
atagggcccg gagacagggt gctggcgctc tcctcgctga gcttcgatct ctcggtctat 14040 
gatgtgttcg ggatcctggc ggcgggcggt acgatcgtgg tgccggacgc gtccaagctg 14100 
cgcgatccgg cgcattgggc agagttgatc gaacgagaga aggtgacggt gtggaactcg 14160 
gtgccggcgc tgatgcggat gctcgtcgag cattttgagg gtcgccccga ttcgctcgct 14220 
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aggtctctgc 
ctccaggcca 
atctggtcca 
ggccgtccgc 
gtctgggttc 
gatgaagaga 
aagaccggcg 
gacaaccaaa 
aagtcgcatc 
aagctccttc 
gcgagcctca 
agcgacggcg 
ggaaagcccg 
gcgcgtcgcc 
cgattcctga 
tatccatcgg 
atcgagggcg 
ctctccgatc 
gaagcggcgt 
tcgtcgtcgc 
caggcgcctt 
cggccggttc 
gtagacccgc 
acgacgcgcg 
ttgaggacca 
ccgctgacgt 
tcgccgcggc 
gtcgtacggg 
ggtgcgacat 
gagatcgcca 
cgccgagact 
cgcaagggca 
gccgcaagcc 
gggtgtgcgc 
aagagtcctc 
tggacgaatt 
aggagctcgc 
cgggcagcgt 
gcgaggcaga 
tggagaacgc 
ccaacatgag 
ccggctggtt 
acaggctgaa 
tggcggttca 
gcgggattac 
tctctcccga 
acggctgcgg 
tccgcgcggt 
ctgcgcccag 
tcgaggcccg 
ccatcgagac 
gcgcgatcgg 
gtttgatcaa 
agtctcctaa 
aggattggaa 
gcaccaacgc 
cggcgcgctc 
cggcggcacg 
ccttcagcct 
cgcgcgaggc 
ccgtgcgtgg 
agggctctca 
cggcgctttc 
agctcgccgc 
tgttcgccct 
tcgtgatcgg 



ggctttcgct 
tcaggcccgg 
tcgggtaccc 
tgcgcaacca 
cggggcaact 
agacgcgcaa 
atctgggccg 
tcaagcttcg 
cgaacgtacg 
tagcctatgt 
agaccgagcg 
agagggtgca 
tcgtcgatct 
gtagcgtccg 
gctgcttgag 
cgggcagcac 
tggacgaggg 
acgggatcga 
tcaacctcct 
gagaattttg 
cctgcaacat 
tcgacctgcg 
ggcagttcca 
gcgcccctcc 
aactacccga 
ccaacggcaa 
attcggggca 
aggtgctcgg 
cgattcacat 
tcaccgagtt 
cgagagatct 
ggagacgtag 
cgcctgcgtc 
gttgagccgt 
cgctatcgca 
ctggaggaac 
ggcgtccgga 
gctggaagac 
gctcatggat 
cggatacgac 
ctcgtacttg 
tcagacgttg 
tctgagaggg 
cttggcgtgc 
cgtccggatc 
cggccattgc 
cgttgtcctc 
tatccttggg 
tgaggtgggc 
gtccatccaa 
ggcggcgctg 
ctccgtgaag 
gacggtcttg 
cccatcgatc 
taccggctcg 
ccatgtcgtg 
tgccgagctc 
gctacgagat 
ggcgacgacg 
gttgcgagag 
ccgctgctcc 
gtgggtcggc 
ggcgtgcgac 
cgacgaaggg 
cgcggtggca 
ccacagcatg 



gctgagcggc 
cgtgtcggtg 
cgtgaggaac 
gacgttccac 
ctacattggc 
gagcttcctc 
ctacctgccc 
cggataccgc 
cgacgcggtg 
ggtcccggag 
gatcgacgcg 
gttcaagctc 
gaccgggcag 
aacgttcctt 
cagcgtggag 
gtacccggtg 
cttctattat 
gcgcggagcg 
gttcgtgggc 
cctgctggag 
cggcgtctgt 
acattcggac 
ggtctgtacg 
cggccgcgag 
gtacatggtg 
ggtcgatcgt 
cacggcgcca 
gctggaggtg 
cgttcgcatg 
gttccagtac 
agatcagcgg 
ctaagagcgc 
accctgggac 
gttgttcgaa 
gtcatcggca 
cttcgagacg 
gtcgaccccg 
gtcgaccggt 
ccgcagcacc 
ccgacggctt 
acgtcgaacc 
atcggcaacg 
ccgagcatct 
atgagcctcc 
ccccatcgag 
cgggccttcg 
ctgaagccgc 
tctgccacaa 
caggcgcaag 
tacatcgaga 
cggcgggtgt 
accggcatcg 
gcgctggagc 
gatttcgcga 
actccgcggc 
ctggaggaag 
ttcgtcgtct 
catctgcagg 
cgcagcccca 
gggctcgacg 
ccaggcaacg 
atgggccggc 
cgggccatcc 
tcctcccagc 
tttgcggcgc 
ggcgaggtag 



gactggatcc 
atcagcctgg 
gtcgacctat 
gtgctcgatg 
ggggtcgggc 
gtgcaccccg 
gatggaaaca 
gttgagctcg 
attgtgcccg 
ggcacacgga 
agagcacacg 
gctcgacacg 
gatccgcggg 
gaggccccga 
cccgacggcg 
caaacctacg 
taccacccgt 
cacgttcggc 
aggatcgacg 
gccggatata 
ccggtggggc 
gtttacgtgc 
ctcggtcagg 
cagcacttcg 
cctacagtct 
aaggccctgc 
cgggacgcct 
gtcgggctcc 
aggagcctgt 
ccgaacctcg 
ccgaacatgc 
cgaacaaaac 
tcatctgatc 
cgctgaggaa 
tgtcgggccg 
gcacggaggc 
cgctggtgct 
tcgacgctgc 
ggatcttcat 
acgagggctc 
tccacgagca 
acaaggatta 
ccgttcaaac 
tggaccgcga 
ccggctatgt 
acgccaaggc 
tggaccgggc 
acaacgacgg 
cgatcatgga 
cccacgggac 
tcggtcgcga 
gacacctcga 
accggcagct 
gcagcccgtt 
gggccggcgt 
cgcccgcggc 
cggccaagag 
cgcaccaggg 
tggagcaccg 
cagcggcgcg 
tgccgaaggt 
agctcctggc 
aggccgaagc 
tcgagcgcat 
tgtggcggtc 
ccgccgcgca 



cggtgggcct 
gcggggccac 
cgtgggcgag 
aggcgctcga 
tggcactggg 
agaccgggga 
tcgagttcat 
gggaaatcga 
tcgggaacga 
gacgcgctgc 
ccgccgaagc 
gactccggag 
aggcggggct 
ttccgtttgt 
cgacccttcc 
cgtatgtcaa 
tcgagcaccg 
aaaacttcga 
ccatcgagtc 
tggcgcagct 
aattcaattt 
acggcatgct 
attcctcacc 
ccgatatgct 
tcgtggagct 
gcgagcggaa 
tggaggagat 
agcagagctt 
tgcagaagag 
gctcgctggc 
aggaccgagt 
caggccgagc 
tgatcgcggg 
cggtgagctc 
ttttccgggg 
cgtgcagcgc 
ggacccgagc 
tttcttcggc 
ggaatgcgcc 
tatcggcgtg 
cccagcgatg 
cctcgcgacc 
tgcctgctcc 
gtgcgacatg 
atatgctgag 
gaacggcacg 
gctctccgat 
agcgaggaag 
ggcgctggcg 
cggcacgctg 
cgcttcggcc 
atcggcggct 
gccgcccagc 
ctacgtcaat 
cagctcgttc 
gaagcttcca 
cgcagcggcg 
gatttcgttg 
gctcgcgatg 
aggccagacc 
ggtcttcgtc 
tgaggaaccc 
tggttggtcg 
cgacgtggtg 
gtggggtgtc 
tgtggccggg 



gcctggcgag 
cgaagcgtcg 
catcccctac 



accgcgcccg 
ctactggcgc 
gcgcctctac 
ggggcgtgag 
ggaaacgctc 
cgcggcgaac 
cgagcaggac 
ggacggcttg 
ggacctggac 
ggacgtctac 
tgagtttggt 
caaattccgt 
atccggccgc 
tttgctgaag 
cgtgttcgat 
gctgtatgga 
cctgatggag 
tgaacaggtt 
gggcgggcgg 
gaggcgcgcc 
tcgcgacttc 
cgatgcgttg 
ggatacctcg 
cctcgtcgcg 
cgtcgatctt 
gctggatagg 
gtccggtttg 
ggaggttcgg 
gggccgatga 
tacgcgtcgc 
atggaagaac 
gcgcgggatc 
ttctccgagc 
tacgtccggg 
atcagcccgc 
tgggaggcgc 
tacgccggcg 
atgcggtggc 
cacgtctcct 
acctcgctcg 
gcgctggccg 
gggggcatct 
atcatgggca? 
ggtgatcccg 
atcgggttca 
ctggcagggg 
ctcggagacg 
cggaggtctt 
ggcatcgccg 
ctgaacttcg 
acctctctta 
gggatcggcg 
gccgcggcgc 
ctggatgccg 
ggcgacgtcg 
gcggcgccgt 
ccgccgggcg 
tttcccggcc 
gtcttccacg 
ctgctcgcgg 
cagccggtgc 
gcgpccgacg 
gcgctgtcgc 



14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
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tcgaggatgc 
agggcgagat 
acgaggatcg 
agccggcagc 
gggtgaaggt 
tggcagccct 
cgggcgccat 
agccagtgcg 
tggagatgag 
agcgggcggg 
tggaggcgct 
ccgcgggggg 
tcgaagcgcc 
cgctcctcgg 
cgctggatct 
ttccgggcgc 
gccctttgca 
cggtgttggt 
cgagccgggc 
tccgagtgga 
tccaggccag 
acggccctgc 
gggtacgcct 
tggacgcgtg 
gggtgcccgt 
gccatgcgcg 
gggtggtcga 
ttccgggagg 
ccgcagcggt 
gtgggctcgg 
cggcagagaa 
gccaggctcc 
cagggctcgg 
ccctcgatcc 
ccggcatggg 
tcggcgccgg 
ccatggagca 
gggagctcgc 
tgcgcggtgg 
gggggaggat 
ttgtgaccgg 
gcgctggtca 
ccgtcgcggc 
atcgggcgca 
gcgtcgtcca 
ggtttcgtaa 
gcgaagcgcc 
cgggccaggg 
gggcgcaggg 
cggccgcgca 
ccgacgaggg 
tgatgccggt 
tgttgtcgcg 
acctgctccg 
tcctccgcgc 
ccccgctcac 
aggccatgct 
cgctgagcgg 
ccaccgccga 
tcgcagcaaa 
gctgaaacaa 
ggcggagctg 
cggtgcggac 
gccgctcgac 
ggcggggctg 
tcgggaggcg 



ggtggcgatc 
ggcggtgacc 
ggtgagcgtg 
gatcggcgag 
ggatgtcgcc 
gggcgggctc 
ggtagcgggc 
cttcgccgag 
cccgcatccg 
cgcagcggtg 
gggcacgctg 
gcggcgggta 
ggccaagagc 
tgaaatgcag 
caagcggctg 
ggcgtacctg 
gataactgac 
ccaggtggtg 
gccgggcgct 
gcgcaccgag 
catacccgcc 
cttccagggg 
gcccgacgcg 
cttccagatc 
ggagttgggc 
cgtcgtgaac 
cagctcgggt 
ggtgcgccgg 
cggcacagcc 
cgccgcgttg 
caacacgagc 
gacggcggtg 
ggcgcaaggc 
ggcgctggta 
ctttcgagac 
cgacgtctcc 
cgcggatctg 
tgccctgctg 
cgagcgatgc 
cgagagctgc 
cggtctgggt 
cctggtgctg 
gctcgaggcc 
gctcgagcgg 
tgcggccggc 
ggtgatggcg 
gctttccttc 
caactacgcc 
gctgccagcg 
ggaagatcgc 
gctgtccgct 
gaacccgcgg 
cctggtgacg 
ccgcctcgcc 
gcagatctcg 
gagcctgggc 
gggcatcacc 
gcatctggcg 
ctctgccgtc 
attcaaggcg 
gcggccatca 
gaacggaccg 
gctccggaag 
atgcgctggg 
ctcaccgagc 
cgatcgctcg 



atctgccggc 
gagctgtcgc 
gccgtgagca 
gtgctgtcgt 
agccacagcc 
cggccgggtg 
ccggagctcg 
gtagtccagg 
atcctaacga 
ggctcgctgc 
tgggcgcagg 
ccgctgccga 
gccgcgggcg 
accctgtcaa 
ccgtggctcg 
gagatggcga 
gtggtgctcg 
acgacggagc 
ggccacgcgt 
gtcccggctg 
gcggccacct 
attgctgagc 
gccggctcgg 
gtcggcagcc 
tcgctgcggc 
catgggcacc 
gcagtggtcg 
cgcgaagaag 
aaggtcaacg 
cgcgcgatgc 
gctgccggcg 
gtgcacctcg 
gcattggacg 
cgtggctgcg 
gccccgcgat 
gtgacacagg 
cgctgcgctc 
gccgagctgc 
gtcgctcgga 
gttccgaccg 
gggctcggtc 
gtgggccgct 
cgcggcgcgc 
atcctccgcg 
atcttggacg 
cccaaggtcc 
ttcgtgctgt 
gcggccaaca 
ttgagcgtcg 
ggcgcgcggc 
ctggcacggc 
ctgtgggtgg 
gcgcatcgcg 
gctgccgagc 
caggtgctgc 
atgaactcgc 
gtaccggcaa 
cgggaggcat 
gagatcgagg 
cttacatgac 
tcattcagcg 
agccgatcgc 
cgttttggga 
cgctggtggg 
cgatagattg 
acccgcagca 



gcagccggct 
tggccgaggc 
acagcccgcg 
ccctgaacgc 
cgcaggtcga 
cggctgcggt 
gagcgaatta 
cgcagctcca 
cttcggtcga 
ggcgggggca 
gctaccctgt 
cctatccctg 
atcgccgcgg 
cccagacgag 
gcgaccaccg 
tttcgtcggg 
ccgaggcgct 
agccgtcggg 
ccttccgggt 
ggcttacgct 
acgcggagct 
tatggcgggg 
cagcggagta 
tcttcgcccg 
tcttgcagcg 
aaacccccga 
ccgaagtttg 
acgattggtt 
cgggccggtg 
tggaggccgg 
tacgcgcgct 
gcagcctcga 
cgccccggag 
acagcgtgct 
tgtggctttt 
caccgctgct 
gggtcgacct 
tggccgacga 
tcgtccgccg 
acgtcaccat 
tgagcgtggc 
ccggcgcggc 
gcgtcaccgt 
aggttaccac 
acgggctgct 
agggggcctt 
acgcttcggg 
cgttcctcga 
actggggcct 
tggtctcccg 
tgctcgaaag 
agctctaccc 
cgagcgccgg 
cgagcgcgcg 
gcctccccga 
tgatggggct 
cgctgttgtg 
gcgaagccgc 
agatgtcgca 
tactcgcggt 
gctggaggag 
catcgtcggt 
gctgctcgac 
tgtcgctccc 
cttcgatgct 
tcgtctgttg 



gctccggcgc 
cgaggcggcg 
ctcgacggtg 
gaagggggtg 
cccgctgcgc 
gccgatgcgc 
ctggatgaac 
aggcggccac 
ggagatgcgg 
ggacgagcgc 
accctggggg 
gcagcgcgag 
cgtgcgtgcg 
cacgcggctg 
ggtgcaggga 
ggccgaggct 
ggccttcgcg 
gcggctgcag 
ccacgctcgc 
ttccgctgtg 
gaccgagatg 
tgaaggcgag 
tcggttgcat 
cagtggcgag 
gccttcgggg 
tcggcagggc 
cgggctcgtg 
cctggagctc 
gctgctcctc 
cggccatgcc 
cctggcaaag 
tgggggtggc 
cgccgacgtc 
ctggaccgtg 
gacccgcggc 
ggggctgggc 
cgatccagcc 
cgccgaagcg 
gcagcccgag 
ccgcgcggac 
cggatggctg 
gagcgtggag 
ggcgaaggcg 
gtcggggatg 
gatgcagcag 
gcacctgcac 
agtagggctc 
cgctctggcg 
gttcgcggag 
cggaatgcgg 
cggccgcgct 
cgcggcggcg 
cgggccagcc 
gagcgcgctc 
gggcaagatc 
cgagctgcgc 
gacctatccc 
tcctgtggag 
ggacgatctg 
cctacggcac 
cggctcgctg 
atcggctgcc 
gcggagcgcg 
gtcgaggccg 
gcgttcttcg 
ctggaggtcg 



atcagcggtc 
ctccgaggct 
ctctcgggcg 
ttctgccgtc 
gaggacctct 
tcgacggtga 
aacctcaggc 
ggtctgttcg 
cgcgcggccc 
ccggcgatgc 
cggctgtttc 
cggtactgga 
ggcggtcacc 
tgggagacga 
gcggtcgtgt 
ttgggcgatg 
ggcgacgcgg 
ttccagatcg 
ggcgcgttgc 
cgcgcgcggc 
gggctgcagt 
gcgctgggac 
cctgcgctgc 
gcgacgccgt 
gagctgtggt 
gccgactttt 
gcgcagcggc 
gagtgggaac 
ggcggcggcg 
gtcgtgcatg 
gcctttgacg 
gagctcgacc 
agtcccgatg 
caggccctgg 
gcacaggccg 
cgcgtcatcg 
cggcccgagg 
gaagtcgcgt 
acccggcccc 
agcacctacc 
gccgagcgcg 
caacgggcag 
gatgtcgccg 
ccgctgcggg 
actcccgcgc 
gcgttgacgo 
ttgggctcgc 
caccaccgga 
gtgggcatgg 
agcctcaccc 
caggtggggg 
tcttcgcgaa 
ggggacgggg 
ctggagccgc 
gaggtggacg 
aaccgcatcg 
acggtggcgg 
tcaccgcaca 
acgcagttga 
agcagaatcc 
ggctcgcaca 
gcttccctgg 
acgcggtcca 
tgccgcactg 
gcatctcgcc 
cttgggaggg 



18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
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gctcgaggac 
cggcgctttc 
cgcgtacagc 

ggggttgcag 

tcacctcgcc 

cagcgcgctc 

cgatggtcgt 

tggcctggtc 

gctgatccgg 

cgtgctggct 

ggccgtcgat 

cgaggcgctg 

cgcggtgaag 

ggcagcgctt 

tccgcggatc 

gcgcacggac 

gcatgtggtg 

ggcggagctt 

gctgcgcgag 

ggcgacgacg 

gctgctggcg 

catcgcgagc 

gccgggcatg 

gtgcgtggcg 

ggcggggagc 

cgcggtggag 

ggttgggcat 

agatggggtg 

cgcgatggtg 

ggcgtcggtg 

gcaagcggtg 

gcatgtctcg 

ggtggcggcg 

gaaggtggtc 

ggtgcgcttc 

agtgggcccg 

gacgctgctg 

gggcaggctg 

gcggcgggtg 

ggccgaaggg 

ctggcccgag 

ggtgctggcc 

atgttcgtgc 

ccaggccctc 

cgtcgtggag 

gccggtgctc 

cgtgacccga 

gctgtggggt 

ggacctggat 

gccggacgcc 

ggccgcccca 

ggtgacgggt 

ggcggggcac 

agatcagccg 

cgcgcgggtc 

ggcggccgtc 

gctgctggcc 

ggcatgggtg 

ctcggcgtcg 

tttggacgcg 

gggcctgtgg 

gggaatctgg 

gcgcgcgacg 

cgacgcgagc 

ctcggccgtg 

gctctacgag 



gccggtatcc 

acggcggact 

gccaccggca 

ggaccttgcc 

tgccgcagcc 

ctctcccccg 

tgccggacct 

gtcctcaaac 

ggctcggcca 

caggagacgg 

tacgtcgaga 

cgggcgacgg 

accaacatcg 

tcgctgacgc 

cggctcgagg 

cgcccgcgct 

ctggaagagg 

ttggtgctgt 

cacctggaca 

cgcagcgcga 

gcgctctcgg 

tcctcgcgcg 

ggccgggggc 

ctgttcgacc 

gccgagtcgt 

tacgcgctga 

agcatcgggg 

aggctcgtgg 

tcgctcggag 

tcgatcgcgg 

caggcgatcg 

cacgcgttcc 

tcggtgacgt 

acggacgagc 

gcggacgggg 

aagccgacgc 

gcgtcgttgc 

tgggccgccg 

ccgctgccga 

ctcggagcca 

atgcctcgct 

gaccggggtg 

gccgtgctcc 

ggtggccgca 

gcgggggcat 

gcgctgattc 

ggggcctgca 

atgggccggg 
ccggaggaga 
gaggatcagc 
ccggagggaa 
gggctgggcg 
cttgtgctga 
ccagaggtgc 
accgtggcgg 
gagccgccgc 
caccaggacg 
ctgcacaccc 
ggcgtcttcg 
ctggcggacc 
gcggaggggg 
gcgatgccga 
cagcgcgtgg 
cgaggccgct 
ccagctgtgg 
cttgtgcgcg 



cgccccggtc 

acgcgcgcac 

acatgctcag 

tgaccgtcga 

tgcgcgcagg 

acatgatgga 

tcgatgcttc 

ggctctccga 

tcaaccatga 

tcttgcgcga 

cccacggaac 

tggggccggc 

gccatctcga 

acgagcgcat 

gcagcgcgct 

tcgcgggggt 

cgccggcggt 

cgggcaagag 

tgcacccgga 

tgagccaccg 

ccgtggcgca 

gcaagctggc 

tttgcgcggc 

gggagctgga 

tgttgctcga 

cggcgctgtg 

agctggtggc 

cggcgcgcgg 

cgccggaggc 

cggtcaatgg 

cggcggggtt 

actcgccgct 

accggcggcc 

tgagcgcgcc 

tgaaggcgct 

tgctcgggct 

gcgccgggcg 

gcggctcggt 

cctatccgtg 

cggccgccga 

catccgtgga 

gagtcgggga 

atgcgcccgc 

acgactggca 

cggccgaaga 

aggcgctcgg 

cggtgggcgg 

tcgcggcgct 

gcccgacgga 

tggcattccg 

acgcagcgcc 

cccttggcct 

tcagccggca 

gcgcgcgcat 

cggtcgacgt 

tgcggggggt 

ctggtcggct 

ttacccgcga 

gctcgatcgg 

tccgccgaac 

ggatgggctc 

cgagtcgggc 

tcatccagat 

tctgggatcg 

agcgctggcg 

gcgtggtcgc 



catcgacggg 
ggtcgctcgg 
catcgccgcc 
cacggcgtgc 
agagagcgat 
agccgcggcg 
ggccaacggg 
cgcgcaacgg 
tggccggtcg 
ggcgctgcgg 
agggacctcg 
gcgctccgac 
ggccgcggca 
cccgagaaac 
cgcgttggcg 
gagctcgttc 
ggagctgtgg 
cgagggggcg 
gctcgggctc 
gctcgcggtg 
ggggcagacg 
gttcctgttc 
gtggccagcg 
ccgcccgctg 
ccagacggcg 
gcggtcgtgg 
ggcgtgcgtg 
gcggctgatg 
ggaggtggcg 
gccggagcag 
cgcggcgcgc 
gatggaaccg 
aagcgtttcg 

ggggtactgg 

gcacgaagcc 
gttgccagcc 
cgaggaggct 
cagctggccg 
gcagcggcag 
tgcgctggcg 
ttcgcggcga 
ggcggccgcg 
cgaggcctcc 
gggggtgctg 
ggtcgccaaa 
cacggggccg 
cgagcctgac 
agagcatccc 
ggtcgaggcc 
ccaggggcgc 
ggtgtcgctg 
cctcgttgcg 
cggattgccc 
tgcggcgatc 
ggccgatgcc 
agtgcacgcc 
cgcccgggtg 
gcagccgctg 
ccagggcagc 
gcaggggctc 
gcaggcgcag 
cctggcggcg 
ggattgggcc 
gctggtaact 
caacgcgtct 
cggggtgatg 



agccgcaccg 

ctgccgcgcg 

ggacggctgt 

tcgtcatcgc 

ctcgcgttgg 

cgcacgcaag 

ttcgtccgtg 

gatggcgacc 

accgggttga 

agcgcccacg 

ctgggcgatc 

ggcacacgct 

ggcgtagcgg 

ctcaacttcc 

accgagccgg 

gggatgagcg 

cctgccgcgc 

ctcgatgcgc 

ggggacgtgg 

gcggtgacgt 

ccggcggggg 

accggacagg 

ttccgggagg 

cgcgaggtga 

ttcacccagc 

ggcgtagagc 

gcgggggtgt 

caggggctct 

gcggcggtgg 

gtggtgatcg 

ggcgcgcgca 

atgctggagg 

ctggtgagca 

gtgcggcacg 

ggcgcgggga 

tgcctgccgg 

gcgggggtgc 

ggcgtcttcc 

cggtactgga 

cagtggttct 

gcccggtccg 

gcggcgcttt 

gcggttgccg 

tacctgtggg 

gtcacccatc 

cgctcacccc 

gctgccccct 

ggctcctggg 

ctggtggccg 

cggcgcgcag 

tctgcggagg 

cggtggttgg 

gaccgcgagg 

gaggcgctgg 

gaaggcatgg 

gcgggtctgc 

ttgcgcccca 

gacctcttcg 

tacgcggcag 

gccgccctga 

cgccgggaac 

atggaatggc 

catgcgggag 

gccacgaaag 

gttgtggaga 

ggctttaccg 



gtgtgttcgt 

aggagcgaga 

cgtacacgct 

tggtggcgat 

cgggaggggt 

cgctgtcgcc 

gcgagggctg 

gcatctgggc 

ccgcgcccaa 

tcgaagctgg 

ccatcgaggt 

gcgtgctggg 

gcctgatcaa 

gcacgctcaa 

tgccgtggcc 

gaacgaacgc 

cggagcgctc 

aggcggcgcg 

cgttcagcct 

cgcgcgaggg 

cggcgcgctg 

gcgcgcagac 

cgttcgaccg 

tgtgggcgga 

ccgcgctctt 

cggagctcct 

tctcgctgga 

cggcgggcgg 

cgccgcacgc 

cgggcgtgga 

ccaagcggct 

agttcgggcg 

acctgagcgg 

tgcgggaggc 

cgttcgtcga 

aggcggagcc 

tcgaggcgct 

ccacggctgg 

tcgaggcgcc 

accgggtgga 

gcgggtggct 

cgtcgcaggg 

agcaggtgac 

gtctggacgc 

ttgccgcggc 

ggctctggats 

gtcaggcggc 

gcgggctcgt 

agctgctttc 

cgcggcttgt 

ggagttactt 

tggagcgcgg 

aatggggccg 

aggcgcaggg 

cggcgctctt 

tcgacgacgg 

aggtggaggg 

tactgttttc 

gcaatgcctt 

gcatcgcctg 

acgaggcatc 

tgctcggtac 

cggcgccgcg 

aggcctcctc 

cccgctcggc 

accagggcac 



22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 



4-30582A 





8 



□ 



gctcgacgtg 
ccgcaaacgg 
tccgaccgtg 
caccgacgtg 
cgcctgccgc 
gggcgtggtg 
ccccggctcg 
ggtggagacg 
cccgcaacag 
cccgtcggcg 
tgccgagcgg 
catgctcagc 
ggctgtggat 
gcgacggggc 
gaccttcgcg 
ctcggccgac 
gctctccgac 
caatcatgat 
gttacgccag 
ccacgggacc 
cgggcaagcc 
gcacatggag 
agagcaaata 
gctgccggtg 
cgcgggggtg 
gccggcggtg 
gggcaagagc 
gcacccggag 
gaaccaccgg 
cgtggcgcag 
caagctggcg 
ttgcgcggcg 
ggagctggac 
gttgctcgac 
ggcgctgtgg 
gctggtggcg 
ggcgcgcggg 
gccggaggcg 
ggtcaatggg 
ggcggggttc 
ctcgccgctg 
ccggcggcca 
gagcgcgccg 
gaaggcgctg 
gctcgggctg 
cgccgggcgc 
cggctcggtc 
ctatccgtgg 
agccgcggat 
cctccagaaa 
agtcggcgag 
tgcgccggca 
cgattggcag 
gatcgatgag 
gtttctgagc 
cgttggcgac 
ggcggcgctc 
cccgccccaa 
gaccgaggat 
cccgccacag 
gggaggcctc 
gcacttggtg 
gccgcctgag 
ggtgaccgtg 
ggtcgagccc 
ggcggagacg 



cgacgaggct 
cttcagggtg 
gagcggctgg 
cggagcgttc 
ttcccgggcg 
gtcagcaccg 
ggagaggcac 
ttcgatgcgg 
cggctgctgc 
ctgcgcgaga 
gtgcaggaac 
gttgcggcgg 
acggcgtgct 
gagtgcgacc 
ctgctctcac 
gcggacggct 
gcgcagcgcg 
ggcccgagca 
gcgctggcgc 
gggacggcgc 
cgccctgcgg 
cccgcggcgg 
ccagcccagc 
gcggtggccc 
agctcgttcg 
gagctgtggc 
gagggggcgc 
ctcgggctcg 
ctcgcggtgg 
gggcagacgc 
ttcctgttca 
tggccagcgt 
cgcccgctgc 
cagacggcgt 
cggtcgtggg 
gcgtgcgtgg 
cggctgatgc 
gaggtggcgg 
ccggagcagg 
gcggcgcgcg 
atggaaccga 
agcgtttcgc 
gggtactggg 
cacgaagccg 
ttgccagcct 
gaggaggctg 
agctggccgg 
cagcggcagc 
ccgacccaag 
tcagaggagg 
gcggtcgctg 
gagacatccg 
gtagtgctct 
atcggcgacg 
accgtgtctt 
gagcctgcga 
gagcatcccg 
gccagcccga 
cagctcgcct 
gggcaagcgg 
ggtgggctgg 
ctgaccagcc 
atccgcgcgc 
gcagcggtgg 
ccgctgcgag 
gacgagaccc 



tcgccgagca 
agctgggtat 
tggaatactt 
ggttgccggc 
gggtcgagga 
aggtgccggc 
agagacagac 
cgttcttcca 
tggaagtgag 
gccccacggg 
tcgccgatga 
gacggctatc 
cctcgtcgct 
aagccctggt 
ggatgcacgc 
acgcgcgggc 
accgcgaccc 
gcgggctgac 
acgcaggggt 
tgggcgaccc 
accgaccgct 
gcctggccgg 
cggagctggg 
gcgcagcggt 
ggatgagcgg 
ctgccgcgcc 
tcgatgcgca 
gggacgtggc 
cggtgacgtc 
cgccgggggc 
ccggacaggg 
tccgggaggc 
gcgaggtgat 
tcacccagcc 
gcgtagagcc 
cgggggtgtt 
aggggctctc 
cggcggtggc 
tggtgatcgc 
gcgcgcgcac 
tgctggagga 
tggtgagcaa 
tgcggcacgt 
gtgcgggcac 
gcctgccgga 
cgggggtgct 
gcgtcttccc 
ggtactggcc 
gctggttcta 
cgagccgcgg 
cagcgctgtc 
cgaccgccga 
acctgtgggg 
cgacccgtcg 
gttcgccccg 
tcgccccttg 
gggcctgggg 
tcgacggcga 
tccgccatgg 
caccggtgtc 
gcctgatcgt 
ggcgcgggtt 
ggatcgcagc 
acgtggccga 
gggtggtgca 
tgctcgagtc 



gggcctcgac 
gccgctgtcg 
gctgagccag 
gacagaggac 
cctggagtcc 
cgaccggtgg 
ctacgtgccc 
catctcgcct 
ctgggaggcg 
cgtgttcgtg 
ggcggcgggg 
atttttcctg 
ggtggcgctg 
tggcggggtc 
actttcgccc 
cgagggctgc 
catcctggcg 
agtgcccagc 
ggttccggcc 
gatcgaggtg 
gatcctggga 
cttgctcaag 
cgagctcaac 
gccgtggccg 
aacgaacgcg 
ggagcgctcg 
ggcggcgcgg 
gttcagcctg 
gcgcgagggg 
ggcgcgctgc 
cgcgcagacg 
gttcgaccgg 
gtgggcggag 
cgcgctcttc 
ggagctggtg 
ctcgctggaa 
ggcgggcggc 
gccgcacgcg 
gggcgtggag 
caagcggctg 
gttcgggcgg 
cctgagcggg 
gcgggaggcg 
gttcgtcgaa 
ggcggagccg 
cgaggcgctg 
cacggctggg 
cgacatcgag 
tcgcgtggac 
gagctggctg 
gacacgtgga 
gctggtgacc 
tctggacgcc 
tgctaccgcg 
actctgggtc 
tcaggcggcg 
cgggctcgtg 
gatgctcgtc 
gcgccggcac 
gctgtctgcg 
ggcccagtgg 
gcccgaccgg 
ggtcgaggcg 
cgtcgaaccg 
cgccgctggc 
ggtgctccgt 



tccctgatgg 
gcgacgctag 
gcgctggagc 
ccgatcgcca 
tactggcagc 
aatggggcag 
aggggtggct 
cgggaggcga 
atcgagcgcg 
ggcgcgggcc 
ctctacagcg 
ggcctgcacg 
cacctcggct 
aacatgctgc 
ggcgggcggt 
gccgtggtgg 
gtgatccggg 
ggccctgccc 
gacgtcgatt 
cgtgcgctga 
gccgccaagg 
gcggtgctcg 
ccgctcttgc 
cgcacggacc 
catgtggtgc 
gcggagcttt 
ctgcgcgagc 
gcgacgacgc 
ctgctggcgg 
atcgcgagct 
ccgggcatgg 
tgcgtggcgc 
ccggggagcg 
acggtggagt 
gctgggcata 
gatggggtga 
gcgatggtgt 
gcgtcggtgt 
caagcggtgc 
catgtctcgc 
gtggcggcgt 
aaggtggtcg 
gtgcgcttcg 
gtgggcccga 
acgctgctgg 
ggcaggctgt 
cggcgggtgc 
cctgacagcc 
tggccggaga 
gtattggcgg 
cttccatgcg 
gaggctgccg 
gtcgtcggtg 
ccggtgctcg 
gtgacccggg 
ttatggggca 
gacctggatc 
accgagctat 
gcggcacggc 
gaggcgagct 
ctggtggagc 
caggcgtggt 
ctggaggcgc 
atgacagcgc 
gtcagcgtca 
cccaaggtgg 



ccgtggagat 
cgttcgacca 
tgcaggaccg 
tcgtgggtgc 
tgttgaccga 
acgggcgcgt 
ttctgcgcga 
tgagcctgga 
cgggccagga 
ccaacgaata 
gcaccggcaa 
ggccgaccct 
gccagagctt 
tctcgccgaa 
gcaagacgtt 
tgctcaagcg 
gtacggcgat 
aggaggcgct 
tcgtggaatg 
gcgacgtgta 
ccaaccttgg 
cgctggggca 
cgtgggaggc 
gcccgcgctt 
tggaagaggc 
tggtgctgtc 
acctggacat 
gcagcgcgat 
cgctttcggc 
cgtcgcgcgg 
gccgggggct 
tgttcgaccg 
ccgagtcgtt 
acgcgctgac 
gcgccgggga 
ggapcgtggc 
cgctcggagc 
cgatcgcggc 
aggcgatcgc 
acgcgtccca 
cggtgacgta 
cggacgagct 
cggacggggt 
agccgacgct 
cgtcgttgcg 
gggccgccgg 
cgctgccgac 
gtcgccacgc 
tacctcgcag 
ataagggtgg 
tcgtgctcca 
gcggtcgaag 
cggaggcgtc 
gcttggctcg 
gggcatgcat 
tgggccgggt 
cccgagcgag 
tgtcgcagga 
tggtggccgc 
acctggtgac 
tgggagcgcg 
gcgagcagca 
ggggtgcacg 
tggtttcgtc 
tgcgtccact 
ccgggagctg 



26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 



4-30582A 



-9- 



gctgctgcac 
cgcagcggtg 
cgggctcgcg 
gtgggccgag 
tctgcccatg 
ggctcagcgc 
agggcgtcgc 
tccggcggca 
gcacgagatc 
cgatcctggg 
caacctcctt 
gacggtacag 
cgacacccag 
ctgccgcttc 
cgtggtggtc 
tgatccggag 
gagattggat 
gcagcggttg 
tacgctgcga 
gcagcggctg 
gctcagcgtt 
catggatacg 
actgggcgag 
cttcgtgctg 
ggccgacgcg 
gcgcgatgcg 
ccacgacggc 
gcgccaggcg 
cgggacaggg 
tccagggcgc 
tctggaggcg 
gcagatcccg 
gccggtggcg 
cggcgtgagc 
ggaggtggag 
caagagcgcg 
cccggagctg 
gcaccggctc 
ggcgcagcaa 
gctggctttc 
cgaaacgtgg 
gatcgaccag 
gctcgatcag 
cctgtggcgt 
ggtcgccgcc 
gcgcgggcgg 
cgaggccgag 
caacggtcct 
gacgttcgcg 
gccgctcatg 
cgcgccagac 
cacgcccgag 
ggcgttgcat 
cgggctgttg 
ggaccgctcg 
tgcgctcgac 
tccatggcag 
gatcgcaggt 
ccacgtgctc 
caaggtggtg 
ctggcccgag 
gcccgaccag 
cctgttcgag 
ccgcggtcgg 

ggaggaccgc 

gcggatcggc 



cggctgctgc 
tggggtagcc 
catcttcggc 
ggaggcatgg 
tcgacgtcgg 
acggtgaccc 
aacctgcttt 
gcaacccgga 
gtccatgggg 
atggggttca 
caggctgagc 
cggctggtgg 
catgttcggt 
ccgggcgggg 
agcgccgagg 
atcccaggcc 
gcgaccttct 
ctcctggagg 
gatagcccca 
cgaggcttca 
acggctggac 
gcgtgctcgt 
tgcgatcaag 
ctctcacgga 
gacggctacg 
cagcgcgccg 
ccgagcagcg 
ctttcgcaag 
acggcgctgg 
tccggggacc 
gcatctggct 
gcccagccgg 
gtgccacgta 
gcgttcgggt 
ccggcgcccg 
gcggcgctgg 
agcctcggcg 
gccatcgcga 
aagacgccgc 
ctgttcaccg 
cctgcgttcc 
cctctgcgcg 
accgcgtacg 
tcgtggggcg 
tgcgtggcgg 
ctgatgcagg 
gtggccgcct 
gacgccgtcg 
gcgcgtggga 
gatccgatgc 
cgcccggtgg 
tattgggtcc 
gccgcgggtg 
ccagcgtgcc 
gaatgcgagg 
tggaagggcg 
cgtgagcgcc 
cgctggccgc 
tcgatcggac 
gtgcccggcg 
cgggcgatcg 
gaggtcgagc 
ctggcgaccc 
gtgcagccga 
gcgatccagc 
tggggtccgc 



acggccggcc 
atagccaggg 
gttcgcaatc 
cggacgcgga 
cagcgttgtc 
ggatggactg 
cggcgctggt 
actggcgtgg 
ccgtcgctcg 
atgagcaggg 
tggacgtgcg 
agcatctgct 
cgttggcgtc 
tggaggacct 
tgccggccga 
ggacttacgt 
tccgcatctc 
taagctggga 
ccggggtgtt 
ccgacggagc 
ggctgtcgtt 
catccctggt 
cgctggttgg 
tgcgcgcgct 
cgcggggcga 
gcgactccat 
ggctgaccgt 
caggcgtgtc 
gcgacccgat 
gaccgctggt 
tggccagcct 
agctggggga 
aggcggtgcc 
tgagcggaac 
cggcgccggc 
acgccgcggc 
acgtggcgtt 
cgacctcgcg 
agggcgcggt 
gacagggcgc 
gggaggcgtt 
aggtgatgtg 
cgcagccggc 
tggagccgca 
gcgtgttctc 
cgctacccgc 
ccgtggcgcc 
tgatcgccgg 
tacgcacgaa 
tggaagactt 
tgtcgaatgt 
ggcatgtgcg 
ccgccacgtt 
tcggggaagc 
tggtcctcgc 
tgttccccga 
attggatgga 
tggctggtgt 
cacgccatca 
cctttcatgt 
agctgacagg 
tccacgccgt 
tggcggcgcc 
cagacggcgc 
ccctcgactt 
tttggcgatg 



tctcgacctg 
tgcgtacgcg 
gctgcctgcg 
ggctcatgca 
ggcgctccag 
ggcgcgcttc 
cgcagggcgc 
cctgtccgtt 
ggtgctgggc 
cctcgactcg 
gctttcgacg 
cgtcgatgta 
agacgagccc 
ggagtcctac 
ccggtgggat 
gaccaaaggc 
gcctcgcgag 
agcgctcgag 
cgtgggtgcg 
ggcagggttg 
tttcctgggt 
cgcgctgcac 
cggggtcaac 
ttcgcccgac 
ggggtgcgcc 

cctggcgctg 
acccaacgga 
tccggtcgac 
cgaggtgcag 
gctgggggcc 
gctcaaggcc 
gctcaacccg 
gtgggggcgc 
caacgtgcat 
gcgaccggtg 
ggcacggctc 
cagcctggcg 
cgaggccctg 
gcgcggcaag 
gcaaatgccg 
cgaccggtgc 
ggctgcgccg 
tctctttgcg 
cgtactgctc 
gctcgaagat 
cggcggtgcc 
ccacgccgcc 
cgccgaggta 
gaggctcgcc 
ccagcgggtc 
caccggccac 
aagcgccgtg 
cgtcgaggtt 
ggacgcggtc 
ggcgctcggg 
tggcgcgcgc 
cctcaccccg 
cgggctctgc 
gcccttcctc 
cgcggtgatc 
cgtggagttc 
gctcaccccc 
ggagaccgaa 
gcccggcgcg 
cgccggattc 
gctgcaggac 



ttcgtgctgt 
gcggccaacg 
ttgagcgtcg 
cgtctgagcg 
cgcctggtgg 
gcgccggtgt 
gacatcatcg 
gcggaagccc 
ttcctcgacc 
ttgatggcgg 
acgctggcct 
ctgaagctgg 
atcgccatcg 
tggcagctat 
gcggcggact 
gccttcctgc 
gcgatgagcc 
agcgcgggta 
gggcccaatg 
tacggcggca 
ctgcacggcc 
ctcgcctgcc 
gtgctgctcg 
gggcggtgca 
gtggtggtgc 
atccggggaa 
cccgcccagc 
gttgattttg 
gcgctgagcg 
gccaaggcca 
gtgcttgcgc 
cacttgccgt 
ggcgcacgcc 
gtcgtgctgg 
gagctggtcg 
tcggcgcacc 
acgacgcgca 
cgaggcgcgc 
gccgtgtcct 
ggcatgggcc 
gtggcgctct 
ggcctcgctc 
ctggagtacg 
ggtcatagca 
gcggtgaggt 
atggtagcca 
acggtgtcga 
caggtgctcg 
gtctcccatg 
gctgcgacga 
gtcgcaggcc 
cgcttcggcg 
ggcccgaagc 
ctcgtgccgt 
gcttggtatg 
cgcgtggctc 
cgaagcgccg 
atgcccggcg 
ggtgatcacc 
ctcagcatcg 
ctgaaggcca 
gaagccgccg 
cgccgatgga 
ttgccgcgcc 
ctcgacaggt 
gggcgcgtcg 



tctcgtcggg 
ctttcctcga 
cgtggggtct 
acatcggggt 
agaccggcgc 
acaccgctcg 
cgccttcccc 
gcgtggctct 
cgagcgcgct 
tggagatccg 
ttgatcatcc 
aggatcgcag 
tgggagccgc 
tggccgaggg 
ggtacgaccc 
gcgatttgca 
tcgacccgca 
tcgctccgga 
agtactacac 
ccgggaacat 
cgacgctggc 
agagcctgcg 
cgccggagac 
agacgttctc 
tcaagcggct 
gcgcggtgaa 
aagcattgct 
tggagtgtca 
aggtgtatgg 
acgtcgcgca 
tgcggcacga 
ggaacacgct 
cgcgtcgggc 
aggaggcacc 
tgctatcggc 
tgtccgcgca 
gcccgatgga 
tggacgccgc 
cacgcggtaa 
gtgggctgta 
tcgatcggga 
aggcggcgcg 
cgc^ggctgc 
tcggcgagct 
tggtggccgc 
tcgcagcgto 
tcgccgcggt 
ccctcggcgc 
cgttccactc 
tcgcgtaccg 
ccgagatcgc 
acggggcaaa 
cggtcctgct 
cgctacgcgc 
cctggggggg 
tgcccatgta 
cgcctgcagg 
ctgtgttgca 
tcgtgtttgg 
ccgccgagcg 
tcgcgatgga 
gggatggcta 
cgacccacgc 
tcgaggtgct 
tatcggcggt 
gcgacgaggc 



30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
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ctcgcttgcc 
gatcctgctg 
cgacgggacg 
tggaagggtg 
gctggtcgac 
gccgcgagag 
cgactggccc 
cgtggtggca 
cgtcctcgcc 
gatctgcctc 
gaccgagggc 
gaccatgggc 
atggggcctc 
tttggagccg 
cgctgacgac 
caaagcgacg 
tgggcagaag 
cccgggcgag 
tgtgctggga 
cacggcggtg 
gacgttgcat 
gactcccgcg 
cgacctgggg 
gggcatggcc 
cccgtccaag 
gcggacgctg 
ggtgctcaac 
cgggcggttc 
gcatcccggt 
agagatcctc 
gcatgcgttc 
tcagggcaag 
cgtactgctg 
gcagggcgtg 
caaagccgtc 
cgccgatcgg 
gggcgtgatc 
ccgcttctcg 
ggcgggcaac 
ggccgggcag 
gcgggccgaa 
ggcagcgggg 
gtcgccggct 
cggggcgatg 
tgtgtggcgc 
attggccgcg 
cgtgcaggcc 
tcggccgctg 
cggccagcgg 
cgcgctcacg 
cgcaaagtcg 
ccgtttccca 
cgatgccgtc 
tccggatgtg 
gttcgatccg 
gcggctgctc 
gctgatgggc 
cgccggcggc 
ctcgggcagg 
gtgctcctcg 
ttcggtggcg 
cagccggctg 
cggcgtgggg 
gcgcgatggg 
cagcaacggg 
ggagcaggcg 



accctcgtgc 
gacaacggct 
cccccgctgc 
cggtgtggcg 
gaaactggcg 
gtgttcctgc 
gaagcgccct 
gcacctggct 
gaacccaaag 
tgggaggctg 
ctctcggtgg 
gcagtggccg 
ggccggacag 
gaggccgatg 
gagacacagg 
acccccgaag 
ggcacattgg 
gtcgagatca 
atgtatccgg 
ggccaggggg 
cgattcgtca 
caggcagcta 
aatctgcggc 
gcggtgcaaa 
tgggcagcgg 
gagtttgctg 
gcgctggccg 
ctcgagatgg 
gttcgctatc 
gagcgcgtgg 
gcgatcacca 
gtcgtgctgc 
accggtgggc 
ccgcacatgg 
gcggagatcg 
aatgcgctgg 
catgcagccg 
cgggtgctgg 
gatctcgctt 
tccaactatg 
ggcctggcgg 
ctcagcgcgg 
cagggcaccg 
tcgctcgaqg 
gcgttggtgc 
cgtcttgggg 
gagatcgcgc 
tcggacttgg 
gtgggtgcga 
cgctggctgc 
tcgccgcagg 
ggcggcgtgg 
gtcgaggtgc 
cgcggcaaga 
gccttcttcg 
ctggagacga 
agcgataccg 
atcgaggcgt 
atctcttatg 
tcgctggtcg 
ctggccggcg 
cgaggcctgg 
tggagcgaag 
gatccgatcc 
ctgacggcgc 
gggctggctc 



cgacctatcc 
ttgcggtgag 
cgttcgccgt 
gcgtgccgcg 
aggtggtcgc 
ggcaggagtc 
tgcccgatgc 
eggagatggc 
gcctcgaggc 
gagcccacga 
tgcaggcgct 
tcgaggccgg 
tgatgcagga 
cagcgcgctc 
tggctttccg 
ggctcctggt 
accagctccg 
aggtaaccgc 
gcgacgccgg 
tgcgccacgt 
cggtcgacgc 
cggtgccggt 
gcggcgagcg 
tcgcccgatg 
ttcaggccat 
agacgttccg 
gcgagttcgt 
gcaagaccga 
gggtattcga 
tcgagggctt 
aggccgaggc 
tgccggcgcc 
tgggagcgtt 
tgctcacagg 
aagcgctcgg 
aggctgtgct 
gagcgctcga 
caccgaaggt 
tcttcgtgct 
cggcggccaa 
cgcagagcct 
cgctgcaggc 
cgctgctcgg 
tgcgtgcggc 
gcgcggaggc 
cgctgcccga 
gcgtgctttc 
gcctcgactc 
cgctgccggc 
tcgataaggt 
tcgccctcga 
ccgatccgga 
cgcatgagcg 
tgacgacacg 
gcatctcgcc 
gctgggaggc 
gcgtgttcgt 
tcgatggcta 
tgctcgggct 
cggtgcacct 
gcgtggcgct 
ctcccgacgg 
gctgcgccat 
tggcggtgat 
ccaacgggtc 
cggcggacgt 



gaacgcccac 
cctgctgtca 
ggaacgggtg 
gtcgcaggca 
cgaggtggag 
gggcgcgtcg 
gcctgcggaa 
cgcggcgctc 
ggccctcgcg 
ggaagctccg 
cagggaccgc 
tgagcgggtg 
gcgcccggag 
agctgacgtt 
ttccggaaag 
ccctgacgca 
cctcgcgccg 
ctcggggctc 
gccgatgggc 
cgcggtcggc 
gcggctggtg 
cgcgttcctg 
ggtgctgatc 
gataggggcc 
gggcgtgccg 
gcaggtcacc 
ggacgcgagc 
catacgggat 
catcctggag 
tgctgcggga 
agcgtttcgg 
ctccgcagcg 
ggggctccac 
tcggcggggc 
cgctcgggtg 
ccaggccatt 
tgatggtgtg 
gactggcgcc 
gttctcctcc 
caccttcctc 
cgcgtggggc 
gcggctcgct 
gcaggcgctg 
aagccaagct 
gcgccatacg 
ggcgcgtcgc 
atggagcgcc 
gctcacggcg 
gacgctggca 
cctggccgtg 
cgagcccatt 
gtcgttttgg 
atgggacatc 
ctttggcggc 
gcgcgaagcg 
gttcgagcgc 

ggggctcttc 

tctaggcacc 
aaaggggccg 
ggcctgccag 
gatgctcacg 
acggtgcaag 
gctcctgctc 
ccgcggcacc 
gtcgcagcaa 
cagctacgtc 



gacgtggcgc 
acccggagcg 
cggtggtggc 
ttcggtgtct 
ggatttgttt 
actgcagcct 
cggatcgagg 
gcaacacggc 
ggggtgtctc 
gcggcggcgc 
gcggtgcgcc 
caggtcgcca 
ctcagctgca 
ctgttgcggg 
cgccgcgtag 
gagtcctatc 
gcacagcgcc 
aacttccgga 
ggagattgtg 
gatgctgtca 
gtccggcagc 
acggcctggc 
catgctgcgg 
gaggtgttcg 
cgcacgcaca 
ggcggccggg 
ctgtccctgc 
cgagccgcgg 
ctcgctccgg 
catctgcgcg 
ttcatggcgc 
cccttggcgc 
gtggcccgct 
ctggatacgc 
acgatcgcgg 
ccggcggagt 
cttgatgagc 
tggaatctgc 
atgtcggggc 
gacgcgctgg 
ccatggtcgg 
cggcatggga 
gctcggccgg 
tcgggagcgg 
gcggctgggg 
gccgacgagg 
gcgagcgccg 
gtggagctgc 
ttcgatcacc 
gccgagccga 
gccatcatcg 
cggctgctcg 
gacgcgttct 
ttcctgtccg 
acgaccatgg 
gccgggattt 
taccaggagt 
ggcaccacgg 
agcctgacgg 
gcgctgcggc 
ccggcgacgt 
agcttctcgg 
aaaccgcttc 
gcggtgaacc 
gaggtgatcc 
gagtgccacg 



ccttgcaccc 
agccggagga 
gggcgccggt 

cgagcttcgt 
gccgccgggc 
tgtaccgcct 
agagctgggt 
tcaaccgctg 
ccgcaggtgt 
agcgtgtggc 
tgtggtgggt 
cagcgccggt 
ctctggtgga 
agctcggtcg 
cgcggctggt 
gactggaggc 
gggcacctgg 
ccgtcctcgc 
ccggtgtcgc 
tgacgctggg 
ctgcagggct 
tcgctctgca 
ccggcggtgt 
ccacggcgag 
tcgccagctc 
gcgtggacgt 
tgtcgacggg 
tcgcggcggc 
atcgaactcg 
cattgccggt 
aagcgcggca 
cgacgggcac 
ggctcgccca 
cgggcgctgc 
cgtcggatgt 
ggccgttaca 
agaccaccga 
atgagctcac 
tcttgggctc 
ccgcgcatcg 
acggaggcat 
tgggagctct 
aaacgcagct 
cagtgccgcc 
cgcagggggc 
tgcgcaaggt? 
tgcccgtcga 
gcaacgtgct 
cgacggtcga 
gcgtatcgtc 
gca^cggctg 
aaqagggcag 
atgatccgga 
atatcgaccg 
atccgcagca 
tgcccgagcg 
acgctgcgct 
ccagcgtcgc 
tggacaccgc 
ggggcgagtg 
tcgtggagtt 
ccgcagccga 
gcgatgcgca 
aggatgggcg 
gtcgggccct 
gcaccggcac 



34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
34920 
34980 
35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
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gacgttgggc 
ctcggaccgg 
ggcgggcgtg 
gagcctgcat 
ggccgccaaa 
gtttggcgtc 
cgcgcccgcg 
gctggacgcg 
cggcgacctg 
ggcggcgacc 
gccgcccgca 
ctttcctggc 
cgtcttccgc 
gctgctcgcc 
gcagccggcg 
cgagccggat 
cgccctgtcg 
gatcagcggc 
gctcctgggc 
gctggcgggc 
gttctgccgt 
cgacgagcta 
ctcgacggtg 
caacgttcga 
tgggctgttc 
acgggcgacg 
cctgtccatg 
gcggctgttc 
gcagcgcgag 
tgctcatgcg 
cacgcgcgtg 
ggtgcagggg 
ggccgaggcc 
ggccttcgcg 
ccgcctgcaa 
ccatgcccgc 
ggccgcgctt 
ggccgagatg 
ggagggcgag 
ccggctccac 
cggcgaggcg 
gtcgggggag 
gcggagtacc 
gctcgtggcg 
ggagccggct 
gctcatcggc 
ccattccgtc 
gacggcgtcc 
gcgtggcgtg 
gcgcggctgc 
tcctccgcgg 
tgtggcgcaa 
gcgctgcgct 
tgccgagctg 
cgtggcccgg 
ggaaggccgg 
ccgagccacg 
ggggctcaac 
tccggttgcg 
gagccttcgt 
caccatcgac 
cgcgctgccc 
ggccggcgag 
gatcgcccgc 
gtggctgcgc 
gcaagtgctg 



gaccccatcg 
ccgctcgtga 
gccggtgtca 
ttcgacgcgc 
cccgtcgaat 
agcgggacca 
gcggcgcgtt 
caggcggcgc 
gcgttcagcc 
tcgcgcgagg 
gcggctcgcg 
cagggctccc 
gacgcgctct 
gagctcgcgg 
ctgttcgcga 
gcagtggtag 
ctcgaggatg 
caaggcgaga 
tacgaagacc 
gagccggcag 
cgagtcaagg 
ttggcagcat 
acgagcacga 
cagccggtgc 
gtggagatga 
aagcgggagg 
ttggaggcgc 
tccgcgggcg 
cggtactggg 
ggcagtcacc 
tgggagacga 
gcggtcgtgt 
ttgggtgacg 
gatgatacgc 
ttccacgttg 
ggggtgctgc 
cgtgcccggc 
gggctcgagt 
gcgctgggac 
cccgcgctct 
acgccatggg 
ctgtggtgtc 
gacttctggg 
cagcggctcg 
tgggaaccga 
tcgggcggcg 
gtccacgcga 
ttcgacggcc 
ctcgacgcgg 
gacagcgtgc 
ttgtggctcg 
gcgccgctcc 
cggatcgacc 
ttggccgacg 
ctcgtccgaa 
ccgttccggc 
gagcggcgcc 
tttctcgacg 
ctgggcgccg 
atcggccagg 
gcccggatgc 
gtcgcattca 
cgcgtgctca 
cacctcggcg 
gagcagggga 
gccgcgacga 



aagtgcaggc 
tcgggtcggt 
tcaaggtggc 
ccaatccgca 
ggacgagaaa 
acgcgcacgt 
cagcggagct 
ggctttcggc 
tggcgacgac 
cgctgtctgc 
gccacgcttc 
agtggctggg 
cggcgtgtga 
ccgatgagac 
tcgaggtcgc 
gccacagcat 
ctgtagcgat 
tggcggtcgt 
ggctcagcgt 
cgctcgcaga 
tggacgtcgc 
tgggcgagct 
tcatggcggg 
gcttcgccga 
gcccgcatcc 
gagtcgcggt 
tgggagcgct 
gcgcgggcct 
tcgatgcgcc 
cgctcctggg 
cgctggatct 
tcccgggcgc 
gtccgctcca 
cggcggcggt 
cgagccgggt 
gccagatcga 
ttcaggccag 
acggcccagc 
gtgtgcggct 
tggatgcgtg 
tacccgtgga 
atgcgcggag 
tggtcgacag 
cgggaggtgt 
ccgcggtccc 
ggctcggcgc 
cagggcgcgg 
aggccccgac 
atgccccctt 
tctggaccgt 
tgacacgcgg 
tggggctggg 
tcgatccagc 
acgccgagga 
ggctgcccga 
tggagatcga 
ctcctggccc 
tgatgagggc 
agtgctccgg 
acgtcgtggc 
tcgcacctcg 
tgacggcctg 
tccactcggc 
cggagatatt 
tcgcgcacgt 
agggcgaggg 



cctgggcgcc 
gaagtccaat 
gctggcgctc 
cattccgtgg 
cggcgtgccg 
ggtgctggag 
tttcgtgctg 
gcacgtcgtt 
ccgcagcccg 
cgcgctcgac 
cacaggcagc 
catgggccaa 
ccgagcgatt 
cacctcgcag 
gctgtcggcg 
gggcgaagtg 
catctgccgg 
cgagctttcc 
ggcggtgagc 
ggtgctggcg 
cagccacagc 
cgagccgcga 
cccggagctc 
agcggtgcaa 
gatcctgacg 
gggctcgttg 
ctgggtacac 
ccgtcgcgtg 
gaccggcggc 
tgaaatgcag 
caaacggctg 
ggcgtacctg 
ggtcagcgat 
gcaggtcatg 
gccgggccac 
gcgcgccgag 
cgcacccgct 
gttccagggg 
ccccgaggcc 
cttccacgtg 
aatcggctcg 
tgtgagccac 
cacgggcgcg 
acgccggcgc 
cggatccgag 
tgcgctccac 
cacgagcgcc 
gtcggtggtg 
cgacgccgat 
gcaggccgtg 
cgctcaggcc 
ccgcgttatc 
gcggcgcgac 
ggaagtcgcg 
gaccgactgc 
tgggtccggc 
gggcgaggtc 
catggggatc 
ccgaattgtc 
cgtcgcgccc 
ccccgcggcg 
gtacggtctc 
gacggggggc 
tgcgaccgct 
gatggactcg 
ggtcgacgtc 



gtgctggcac 
atcggacata 
gagcgcgggc 
tcggagctcg 
cgacgagccg 
gaggcgccag 
tcggcgaaga 
gcgcacccgg 
atgacgtacc 
acagcggcgc 
gccccaaagg 
aagctcctct 
caggccgaag 
ctcggccgca 
ctgtggcggt 
gcggccgcgc 
cgcagcctgc 
ctggccgagg 
aacagcccgc 
atccttgcgg 
ccacagatcg 
caagcgaccg 
gtggcgagct 
tcgttgatgg 
acatcggtcg 
cggcgtggac 
ggccaggcgg 
ccgctgccga 
gcggcgggcg 
accctgtcga 
ccgtggctcg 
gagatggcgc 
gtggtgctcg 
gcgaccgagg 
ggcggtgctg 
gtcccggcga 
gcggctacct 
cttgtcgagc 
gccggctccc 
agcagcgcct 
ctgcggtggt 
ggaaagccaa 
atcgtcgccg 
gaagaagacg 
gtcatggcgg 
tcggcgctga 
gccgggttgc 
cacctcggca 
gcgcttgagg 
gccggggcgg 
atcggcgccg 
gccttggagc 
ggagaagtcg 
tttcgcggcg 
cgagagaaaa 
gtgctcgacg 
gagatcgccg 
taccctgggc 
gcgatgggcg 
ttcagtttcg 
ctgacggccg 
gtccatctgg 
accgggctcg 
ggtacaccgg 
cggtcgctgg 
gtgttgaact 



aggggcgacc 
cgcaggctgc 
ttatcccgag 
ccgtgcaggt 
gggtgagctc 
cggcggcgtt 
gcgccgcggc 
agctcggcct 
ggctcgcggt 
aggggcaggc 
tggttttcgt 
cggaggagcc 
ccggctggtc 
tcgacgtggt 
cgtggggcgt 
acgtcgccgg 
tgctgcggcg 
ccgaggcagc 
gctcgacggt 
caaagggggt 
acccgctgcg 
tgtcgatgcg 
actgggcgga 
aagacggtca 
aggagatccg 
aggacgagcg 
tgggctggga 
cctatccctg 
gcagccgctt 
cccagaggag 
gcgatcaccg 
tttcgtccgg 
ccgaggcgct 
agogaccagg 
cctttcgaag 
ggctggatct 
atgcggcgct 
tgtggcgggg 
cagccgcgtg 
tcgctgaccg 
tccagcggcc 
cacccgaccg 
agatctccgg 
actggttcat 
gccggtggct 
cggaagctgg* 
aggcactctt 
gcctcgatga 
agtcgctggt 
gcttccgaga 
gcgacgtctc 
acgccgagct 
atgagctgct 
gtgagcggcg 
tcgagcccgc 
acctggtgct 
tcgaggcggc 
ccggggacgg 
aaggtgtcga 
gcaqccacgt 
cgqaggcagc 
ggaggctccg 
ctgctgtgca 
agaagcgggc 
acttcgccga 
cgctgtctgg 



38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
39360 
39420 
39480 
39540 
39600 
39660 
39720 
39780 
39840 
39900 
39960 
40020 
40080 
40140 
40200 
40260 
40320 
40380 
40440 
40500 
40560 
40620 
40680 
40740 
40800 
40860 
40920 
40980 
41040 
41100 
41160 
41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
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cgccgcgatc 
caagacggac 
ctacagcgcc 
gctggcggag 
cttccccctc 
gaagctcgtg 
cgtcgccatc 
gagcgtggct 
cggcgcggtg 
tgtcacggta 
ggttaccgcg 
cgggctgctg 
aggggccttg 
cgcttcggga 
gttcctcgac 
ctggggcctg 
ggtcacccgc 
gctcgacggc 
gttctacccg 
ggcttccggt 
gggcgcgcgg 
cctctccgaa 
gatggggcta 
cctgctgtgg 
ctctacgggg 
ccacgaagtc 
gcgcgcggga 
gtgag[gttac 
agaccgagcc 
cggaggcgtt 
gctgggcgct 
ccgaggccat 
cgctcgaccc 
gcatcccgcc 
cggagtacct 
ccaccggcaa 
gaccttgcct 
gccgcagcct 
tctcccccga 
gccagacctt 
tgctcaagcg 
gatcggccat 
agggggcgct 
acatcgagac 
gcgctgtggt 
ccaacctcgg 
cgctacatca 
ggatcgaggg 
ggacgcgctt 
tggaggaggc 
tcgtcctgtc 
acctggagaa 
gcagcgcgat 
cgctttcggc 
gcggcagcgc 
tgggccgaaa 
gggccatcga 
cctcgcagct 
tttctgcgct 
gcgaggttgc 
tctgccggcg 
agctgtcgct 
cggtgagcaa 
tgctggcggc 
gccatagccc 
ggccgcgagc 



gacgcgagcc 
atctatgcag 
gtcgatcttg 
gtggtggacc 
tcgcgggccg 
ctcgcgctgg 
cgcgcggacg 
ggatggctgg 
agcgcggagc 
gcgagggcag 
tcggggatgc 
atgcagcaaa 
cacctgcatg 
gcagggctct 
gcactggcac 
ttcgcggacg 
gggacgcgga 
gatcgcaccc 
gcggcggcat 
cggctcgccg 
gcagggatgc 
ggcaagctcg 
gagctgcgca 
acctacccca 
gatggggaat 
gcttcgctcg 
aagaggtgat 
tctggccctt 
gatcgccatc 
ctgggagctg 
cgtaggtgtc 
cgacggcttc 
gcagcatcgc 
caggtccctc 
ccacgccgcc 
catgctcagc 
gaccgtcgat 
gcgcgctcga 
cacgatgcga 
cgacgcgtcg 
attgagcgac 
caatcaggac 
cttgcgcgag 
ccacggggcg 
ggggccggcg 
ccacctggag 
cgagcgcatc 
gaccgcgctc 
cgcgggagtg 
gccggcggtg 
ggcgaagagc 
gcacgtcgag 
ggagcaccgg 
cgcagcgcag 
gccgaaggtg 
gctcatggcc 
ggcggaagcg 
cgggcgcatc 
gtggcggtcg 
ggcggcgcac 
cagccggctg 
ggaggaggcc 
cagcccgcgc 
gctgacggcc 
gcaggtcgac 
ggctgcggtg 



tttcgaccct 
atcgctcgct 
cgggcttggc 
tgctcgcacg 
cggacgcgtt 
aggacccgga 
gcgcctacct 
ccgagcaggg 
agcagacggc 
acgtcgccga 
cgctccgcgg 
cccccgcgcg 
cgttgacacg 
tgggctcgcc 
accaccggag 
tgggtttggc 
gcctcacccc 
aggccggggt 
cttcgcggag 
gggatcggga 
tgcaggaggt 
acgtggatgc 
accgcatcga 
cggtggcagc 
ccgcgcgccc 
acgaagacgg 
tgcgtgacag 
cgcaagacgc 
gtggggatcg 
ctcgacgacg 
gacccaggcg 
gacgccgcgt 
ctgctgctgg 
gtcgggagcc 
gtcgcgcacc 
atcgccgccg 
acggcgtgct 
gagagcgatc 
gctctggcgc 
gccaacgggt 
gcgcggcggg 
ggccggtcga 
gcgctgcgga 
gcaacctcgc 
cgagccgacg 
ggcgctgccg 
ccgaggaacc 
gcgttggcga 
agctcgttcg 
gagcctgagg 
gcggcggcgc 
cttggcctcg 
ctggcggtgg 
gggcacacgc 
gtcttcgtgt 
gaagagccgg 
ggctggtcgc 
gacgtggttc 
tggggagtgg 
gtggccggcg 
ctgcggcgga 
gaggcggcgc 
tcgaccgtgc 
aagggggtgt 
ccgctgcgcg 
ccgatgcgct 



cgtgccggac 
ggggctcgct 
cgtgcgtcgg 

gggagcgctg 

ccggaaaatg 
cgtgcggatc 
cgtgaccggc 
ggctgggcat 
tgtcgccgcg 
tcgggcgcag 
cgtcgttcat 
gttccgcgcg 
cgaagcgccg 
gggccagggc 
ggcgcagggg 
cgccgggcag 
cgacgaaggg 
catgccgttc 
gttgtcgcgg 
cctgctcgaa 
cgtgcgcgcg 
gccgctcacg 
ggccgtgctc 
gctgagtgcg 
gccggataca 
gttgttcgcg 
accgagaagg 
tgaacgagcg 
gctgccgctt 
ggcgcgacgc 
acgacgtacc 
tcttcggtat 
aggtcgcctg 
gcaccggcgt 
agccgcgcga 
gacggctatc 
cgtcatcgct 
tcgcgctggc 
gcacccaggc 
tcgtccgtgg 
atggggaccg 
cggggttgac 
acgccggcgt 
tgggcgaccc 
gagcgcgctg 
gcgtggcggg 
tcaactttcg 
ccgaaccggt 
ggatgagcgg 
ccgcggcccc 
tggatgcgca 
gcgatgtggc 
ccgcgagctc 
cgccgggagc 
ttcccggtca 
tcttccgggc 
tgctcgggga 
agccggtgct 
agccggaagc 
cgctgtcgct 
tcagcggtca 
tgcgtggcca 
tcgccggcga 
tctggcggca 
aagagctgat 
cgacggtgac 



ggccgcttca 
cacttcagga 
cccgagcgcg 
cagccgcttc 
gcgcaagcgc 
cgcgttccgg 
ggtctggggg 
ctggtgctgg 
ctcgaggcgc 
atggagcgga 
gcggccggaa 
gtcatggcgc 
ctctccttct 
aactacgccg 
ctgccagcat 
caaaatcgcg 
ctgtgggcgc 
gacgtgcggc 
ctcatgacgg 
cggctcgcca 
caggtctcgc 
agcctgggaa 
ggcatcacca 
catctggctt 
gggagcgtgg 
ttgattgatg 
ccagctcctg 
cgataccctg 
ccccggcgga 
gatccggccg 
gcgctgggcg 
cgccccccgg 
ggaggggttc 
gttcgtcggc 
agagcgggac 
gtacacgctg 
ggtggccatt 
gggaggggtc 
gctgtcgccc 
ggagggctgc 
gatctgggcg 
ggcgcccaac 
cgaggccgag 
catcgagatc 
cgtgctgggc 
cctgatcaag 
tacgctcaat 
gccctggccg 
gaccaacgcg 
cgagcgcgca 
ggcagcccgg 
gttcagcctg 
gcgcgaggcg 
cgtgcgtggg 
gggctcgcag 
ggcgctggag 
gctctccgcc 
cttcgccatg 
ggtggtgggc 
cgaggacgcg 
gggggagatg 
tgagggtcgg 
gccggcggcg 
ggtgaaggtg 
cgcggcgctg 
gggcggggtg 



tcgagctcgg 
agagcctgtc 
tcgcagcgct 
cggtagagat 
agcatctcgg 
gcgaatccgg 
ggctcggtct 
tgggccgctc 
acggcgcgcg 
tcctccgcga 
tcctggacga 
ccaaggtccg 
tcgtgctgta 
cggccaacac 
tgagcatcga 
gcgcacggct 
tcgagcgcct 
agtgggtgga 
cacggcgcgt 
ccgccgaggc 
aggtgctgcg 
tggactcgct 
tgccggcgac 
ctcatgtcgt 
ctccaacgac 
agtcactcgc 
gagcgcttgc 
gagctcgaga 
gcgggcactc 
ctcgaggagc 
gggctgctca 
gaggcacggt 
gaagacgccg 
gtctgcgcca 
gcgtacagca 
gggctgcagg 
cacctcgcct 
aacatgcttc 
aatggccgtt 
ggtctgatcg 
ctgatccgag 
gtgqtcgccc 
gccatcggtt 
gaagcgctgc 
gcggtgaaga 
gcgacgcttta 
ccgcggatcc 
cggacgggcc 
catgtggtgt 
gcggagctgt 
ctgcgggacc 
gcgacgacgc 
ctgcgagggg 
cgggcctcgg 
tgggtgggca 
ggttgcgacc 
gacgaggccg 
gaagtagcgc 
cacagcatgg 
gtggcgatca 
gcgctggtcg 
ctgagcgtgg 
ctctcggagg 
gacgtcgcca 
ggagcgatcc 
atcgcgggtc 



42000 
42060 
42120 
42180 
42240 
42300 
42360 
42420 
42480 
42540 
42600 
42660 
42720 
42780 
42840 
42900 
42960 
43020 
43080 
43140 
43200 
43260 
43320 
43380 
43440 
43500 
43560 
43620 
43680 
43740 
43800 
43860 
43920 
43980 
44040 
44100 
44160 
44220 
44280 
44340 
44400 
44460 
44520 
44580 
44640 
44700 
44760 
44820 
44880 
44940 
45000 
45060 
45120 
45180 
45240 
45300 
45360 
45420 
45480 
45540 
45600 
45660 
45720 
45780 
45840 
45900 



4-30582A 



- 13- 



cggagctcgg tgcgagctac tgggcggaca accttcggca gccggtgcgc ttcgctgcgg 45960 
cggcgcaagc gctgctggag ggtggccccg cgctgttcat cgagatgagc ccgcacccga 46020 
tcctggtgcc gcccctggac gagatccaga cggcggccga gcaagggggc gctgcggtgg 46080 
gctcgctgcg gcgagggcag gacgagcgcg cgacgctgct ggaggcgctg gggacgctgt 46140 
gggcgtccgg ctatccggtg agctgggctc ggctgttccc cgcgggcggc aggcgggttc 46200 
cgctgccgac ctatccctgg cagcacgagc ggtgctggat cgaggtcgag cctgacgccc 46260 
gccgcctcgc cgcagccgac cccaccaagg actggttcta ccgaacggac tggcccgagg 46320 
tgccccgcgc cgccccgaaa tcggagacag ctcatgggag ctggctgctg ttggccgaca 46380 
ggggtggggt cggtgaggcg gtcgctgcag cgctgtcgac gcgcggactt tcctgcaccg 46440 
tgcttcatgc gtcggctgac gcctccaccg tcgccgagca ggtatccgaa gctgccagtc 46500 
gccgaaacga ctggcaggga gtcctctacc tgtggggcct cgacgccgtc gtcgatgctg 46560 
gggcatcggc cgacgaagtc agcgaggcta cccgccgtgc caccgcaccc gtccttgggc 46620 
tggttcgatt cctgagcgct gcgccccatc ctcctcgctt ctgggtggtg acccgcgggg 46680 
catgcacggt gggcggcgag ccagaggcct ctctttgcca agcggcgttg tggggcctcg 46740 
cgcgcgtcgc ggcgctggag caccccgctg cctggggtgg cctcgtggac ctggatcctc 46800 
agaagagccc gacggagatc gagcccctgg tggccgagct gctttcgccg gacgccgagg 46860 
atcaactggc gttccgcagc ggtcgcaggc acgcagcacg ccttgtagcc gccccgccgg 46920 
agggcgacgt cgcaccgata tcgctgtccg cggaggggag ctacctggtg acgggcgggc 46980 
tgggtggcct tggtctgctc gtggctcggt ggctggtgga gcggggagct cgacatctgg 47040 
tgctcaccag ccggcacggg ctgccagagc gacaggcgtc gggcggagag cagccgccgg 47100 
aggcccgcgc gcgcatcgca gcggtcgagg ggctggaagc gcagggcgcg cgggtgaccg 47160 
tggcagcggt ggatgtcgcc gaggccgatc ccatgacggc gctgctggcc gccatcgagc 47220 
ccccgttgcg cggggtggtg cacgccgccg gcgtcttccc cgtgcgtcac ctggcggaga 47280 
cggacgaggc cctgctggag tcggtgctcc gtcccaaggt ggccgggagc tggptgctgc 47340 
accggctgct gcgcgaccgg cctctcgacc tgttcgtgct gttctcgtcg ggcgcggcgg 47400 
tgtggggtgg caaaggccaa ggcgcatacg ccgcggccaa tgcgttcctc gacgggctcg 47460 
cgcaccatcg ccgcgcgcac tcgctgccgg cgttgagcct cgcctggggc ttatgggccg 47520 
agggaggcat ggttgatgca aaggctcatg cacgtctgag cgacatcggg gtcctgccca 47580 
tggccacggg gccggccttg tcggcgctgg agcgcctggt gaacaccagc gctgtccagc 47640 
gttcggtcac acggatggac tgggcgcgct tcgcgccggt ctatgccgcg cgagggcggc 47700 
gcaacttgct ttcggctctg gtcgcggagg acgagcgcgc tgcgtctccc ccggtgccga 47760 
cggcaaaccg gatctggcgc ggcctgtccg ttgcggagag ccgctcagcc ctctacgagc 47820 
tcgttcgcgg catcgtcgcc cgggtgctgg gcttctccga cccgggcgcg ctcgacgtcg 47880 
gccgaggctt cgccgagcag gggctcgact ccctgatggc tctggagatc cgtaaccgcc 47940 
ttcagcgcga gctgggcgaa cggctgtcgg cgactctggc cttcgaccac ccgacggtgg 48000 
agcggctggt ggcgcatctc ctcaccgacg tgctgaagct ggaggaccgg agcgacaccc 48060 
ggcacatccg gtcggtggcg gcggatgacg acatcgccat cgtcggtgcc gcctgccggt 4812Q 
tcccaggtgg ggatgagggc ctggagacat actggcggca tctggccgag ggcatggtgg 48180 
tcagcaccga ggtgccagcc gaccggtggc gcgcggcgga ctggtacgac cccgatccgg 48240 
aggttccggg ccggacctat gtggccaagg gtgccttcct ccgcgatgtg cgcagcttgg 48300 
atgcggcgtt cttcgccatt tcccctcgtg aggcgatgag cctggacccg caacagcggc 48360 
tgttgctgga ggtgagctgg gaggcgatcg agcgcgctgg ccaggacccg atggcgctgc 48420 
gcgagagcgc cacgggcgtg ttcgtgggca tgatcgggag cgagcacgcc gagcgggtgc 48480 
agggcctcga cgacgacgcg gcgttgctgt acggcaccac cggcaacctg ctcagcgtcg 48540 
ccgctggacg gctgtcgttc ttcctgggtc tgcacggccc gacgatgacg gtggacaccg 48600 
cctgctcgtc gtcgctggtg gcgttgcacc tcgcctgcca gagcctgcga ttgggcgagf 48660 
gcgaccaggc cctggccggc gggtccagcg tgcttttgtc gccgcggtca ttcgtcgcgg 48720 
cgtcgcgcat gcgtttgctt tcgccagatg ggcggtgcaa gacgttctcg gccgctgcag 487 80 
acggctttgc gcgggccgag ggctgcgccg tggtggtgct caagcggctc cgtgacgcgc 48840 
agcgcgaccg cgaccccatc, ctggcggtgg tcaggagcac ggcgatcaac cacgatggcc 48900 
cgagcagcgg gctcacggtg cccagcggtc ctgcccagca ggcgttgcta cgacaggcgc 48960 
tggcgcaagc gggcgtggcg ccggccgagg tcgatttcgt ggagtgccac gggacgggga 49020 
cagcgctggg tgacccgatc gaggtgcagg cgctgggcgc ggtgtacggg cggggccgcc 49080 
ccgcggagcg gccgctctgg ct-gggcgctg tcaaggccaa cctcggccac ctggaggccg 49140 
cggcgggctt ggccggcgtg ctcaaggtgc tcttggcgct ggagcacgag cagattccgg 49200 
ctcaaccgga gctcgacgag ctcaacccgc acatcccgtg ggcagagctg ccagtggccg 49260 
ttgtccgcag ggcggtcccc tggccgcgcg gcgcgcgccc gcgtcgtgca ggcgtgagcg 49320 
ctttcggcct gagcgggacc aacgcgcatg tggtgttgga ggaggcgccg gcggtggagc 49380 
ctgtggccgc ggcccccgag cgcgcagcgg agctgttcgt cctgtcggcg aagagcgcgg 49440 
cggcgctgga tgcgcaggca gcccggctgc gggaccacct ggagaagcat gtcgagcttg 49500 
gcctcggcga tgtggcgttc agcctggcga cgacgcgcag cgcgatggag caccggctgg 49560 
cggtggccgc gagctcgcgc gaggcgctgc gaggggcgct ttcggccgca gcgcaggggc 49620 
acacgccgcc gggagccgtg cgtgggcggg cctcgggcgg cagcgcgccg aaggtggtct 49680 
tcgtgtttcc cggccagggc tcgcagtggg tgggcatggg ccgaaagctc atggccgaag 49740 
agccggtctt ccgggcggcg ctggagggtt gcgaccgggc catcgaggcg gaagcgggct 49800 
ggtcgctgct cggggagctc tccgccgacg aggccgcctc gcagctcggg cgcatcgacg 49860 
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tggttcagcc ggtgctgttc gccatggaag tagcgctttc tgcgctgtgg cggtcgtggg 49920 
gagtggagcc ggaagcggtg gtgggccaca gcatgggcga ggttgcggcg gcgcacgtgg 49980 
ccggcgcgct gtcgctcgag gacgcggtgg cgatcatctg ccggcgcagc cggctgctgc 50040 
ggcggatcag cggtcagggg gagatggcgc tggtcgagct gtcgctggag gaggccgagg 50100 
cggcgctgcg tggccatgag ggtcggctga gcgtggcggt gagcaacagc ccgcgctcga 50160 
ccgtgctcgc cggcgagccg gcggcgctct cggaggtgct ggcggcgctg acggccaagg 50220 
gggtgttctg gcggcaggtg aaggtggacg tcgccagcca tagcccgcag gtcgacccgc 50280 
tgcgcgaaga gctgatcgcg gcgctgggag cgatccggcc gcgagcggct gcggtgccga 50340 
tgcgctcgac ggtgacgggc ggggtgatcg cgggtccgga gctcggtgcg agctactggg 50400 
cggacaacct tcggcagccg gtgcgcttcg ctgcggcggc gcaagcgctg ctggagggtg 50460 
gccccgcgct gttcatcgag atgagcccgc acccgatcct ggtgccgccc ctggacgaga 50520 
tccagacggc ggccgagcaa gggggcgctg cggtgggctc gctgcggcga gggcaggacg 50580 
agcgcgcgac gctgctggag gcgctgggga cgctgtgggc gtccggctat ccggtgagct 50640 
gggctcggct gttccccgcg ggcggcaggc gggttccgct gccgacctat ccctggcagc 50700 
acgagcggta ctggatcgag gacagcgtgc atgggtcgaa gccctcgctg cggcttcggc 50760 
agcttcgcaa cggcgccacg gaccatccgc tgctcggggc tccattgctc gtctcggcgc 50820 
gacccggagc tcacttgtgg gagcaagcgc tgagcgacga gaggctatcc tacctttcgg 50880 
aacatagggt ccatggcgaa gccgtgttgc ccagcgcggc gtatgtagag atggcgctcg 50940 
ccgccggcgt agatctctat ggcacggcga cgctggtgct ggagcagctg gcgctcgagc 51000 
gagccctcgc cgtgccctcc gaaggcggac gcatcgtgca agtggccctc agcgaagaag 51060 
gtcccggtcg ggcctcattc caggtatcga gtcgtgagga ggcaggtagg agctgggtgc 51120 
ggcacgccac ggggcacgtg tgtagcggcc agagctcagc ggtgggagcg ttgaaggaag 51180 
ctccgtggga gattcaacgg cgatgtccga gcgtcctgtc gtcggaggcg ctctatccgc 51240 
tgctcaacga gcacgccctc gactatggtc cctgcttcca gggcgtggag caggtgtggc 51300 
tcggcacggg ggaggtgctc ggccgggtac gcttgccagg agacatggca tcctcaagtg 51360 
gcgcctaccg gattcatccc gccttgttgg atgcatgttt tcaggtgctg acagcgctgc 51420 
tcaccacgcc ggaatccatc gagattcgga ggcggctgac ggatctccac gaaccggatc 51480 
tcccgcggtc cagggctccg gtgaatcaag cggtgagtga cacctggctg tgggacgccg 51540 
cgctggacgg tggacggcgc cagagcgcga gcgtgcccgt cgacctggtg ctcggcagct 51600 
tccatgcgaa gtgggaggtc atggagcgcc tcgcgcaggc gtacatcatc ggcactctcc 51660 
gcatatggaa cgtcttctgc gctgctggag agcgtcacac gatagacgag ttgctcgtca 51720 
ggcttcaaat ctctgtcgtc tacaggaagg tcatcaagcg atggatggaa caccttgtcg 51780 
cgatcggcat ccttgtaggg gacggagagc attttgtgag ctctcagccg ctgccggagc 51840 
ctgatttggc ggcggtgctc gaggaggccg ggagggtgtt cgccgacctc ccagtcctat 51900 
ttgagtggtg caagtttgcc ggggaacggc tcgcggacgt attgaccggt aagacgctcg 51960 
cgctcgagat cctcttccct ggtggctcgt tcgatatggc ggagcgaatc tatcgagatt 52020 
cgcccatcgc ccgttactcg aacggcatcg tgcgcggtgt cgtcgagtcg gcggcgcggg 52080 
tggtagcacc gtcgggaatg ttcagcatct tggagatcgg agcagggacg ggcgcgacca 52140 
ccgccgccgt cctcccggtg ttgctgcctg accggacgga gtaccatttc accgatgttt 52200 
ctccgctctt ccttgctcgc gcggagcaaa gatttcgaga ttatccattc ctgaagtatg 52260 
gcattctgga tgtcgaccag gagccagctg gccagggata cgcacatcag aggtttgacg 52320 
tcatcgtcgc ggccaatgtc atccatgcga cccgcgatat aagagccacg gcgaagcgtc 52380 
tcctgtcgtt gctcgcgccc ggaggccttc tggtgctggt cgagggcaca gggcatccga 52440 
tctggttcga tatcaccacg ggattgattg aggggtggca gaagtacgaa gatgatcttc 52500 
gtatcgacca tccgctcctg cctgctcgga cctggtgtga cgtcctgcgc cgggtaggct 52560 
ttgcggacgc cgtgagtctg ccaggcgacg gatctccggc ggggatcctc ggacagcacg 52620 
tgatcctctc gcgcgcgccg ggcatagcag gagccgcttg tgacagctcc ggtgagtcgg 52680 
cgaccgaatc gccggccgcg cgtgcagtac ggcaggaatg ggccgatggc tccgctgacg 52740 
tcgtccatcg gatggcgttg gagaggatgt acttccaccg ccggccgggc cggcaggttt 52800 
gggtccacgg tcgattgcgt accggtggag gcgcgttcac gaaggcgctc gctggagatc 52860 
tgctcctgtt cgaagacacc gggcaggtcg tggcagaggt tcaggggctc cgcctgccgc 52920 
agctcgaggc ttctgctttc gcgccgcggg acccgcggga agagtggttg tacgctttgg 52980 
aatggcagcg caaagaccct ataccagagg ctccggcagc cgcgtcttct tcctccgcgg 53040 
gggcttggct cgtgctgatg gaccagggcg ggacaggcgc tgcgctcgta tcgctgctgg 53100 
aagggcgagg cgaggcgtgc gtgcgcgtca tcgcgggtac ggcatacgcc tgcctcgcgc 53160 
cggggctgta tcaagtcgat ccggcgcagc cagatggctt tcataccctg ctccgcgatg 53220 
cattcggcga ggaccggatt tgtcgcgcgg tagtgcatat gtggagcctt gatgcgacgg 53280 
cagcagggga gagggcgaca gcggagtcgc ttcaggccga tcaactcctg gggagcctga 53340 
gcgcgctttc tctggtgcag gcgctggtgc gccggaggtg gcgcaacatg ccgcggcttt 53400 
ggctcttgac ccgcgccgtg catgcggtgg gcgcggagga cgcagcggcc tcggtggcgc 53460 
aggcgccggt gtggggcctc ggtcggacgc tcgcgctcga gcatccagag ctgcggtgca 53520 
cgctcgtgga cgtgaacccg gcgccgtctc cagaggacgc agccgcactg gcggtggagc 53580 
tcggggcgag cgacagagag gaccaggtcg cattgcgctc ggatggccgc tacgtggcgc 53640 
gcctcgtgcg gagctccttt tccggcaagc ctgctacgga ttgcggcatc cgggcggacg 53700 
gcagctatgt gatcaccgat ggcatgggga gagtggggct ctcggtcgcg caatggatgg 53760 
tgatgcaggg ggcccgccat gtggtgctcg tggatcgcgg cggcgcttcc gaggcatccc 53820 
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gggatgccct 
tggctcggcg 
ttcgggggat 
atgcccgtcg 
cgctgaccag 
tgggcttgcc 
atcaccggtg 
catcatcgcc 
gcctcacgct 
aggtaggggt 
gattggcgct 
acgcgtcgaa 
tggagaagca 
agcggcacgt 
accgcatcga 
acgtagcagc 
acgctccggc 
tgcggggcat 
cgcagatcgt 
tgagcatgcc 
M gcggaccgag 
rg tccggaggca 
gcgctgggcg 
caccgaggcg 
i ,a& gtcgctcgat 
:i g cggcatcgca 
■■.j cagcgactac 
; j| caccggcaat 
; ™t accctgcctg 
"'M ccgcagcctg 
ttcgtccaag 
i iA ccggacattc 
gctcaaacgg 
^ ttcggccatg 
I::* ggaggcgctc 
;y tgtcgagacc 
tgccgtgttg 
; f aaacctcggc 
^ ! tctgcaccac 
gatcgagggg 
accgcgcttc 
ggaggaggcg 
ggtgctgtcg 
catcgccgcg 
tagcccgatg 
gctggaggtt 
ctcgcccggc 
ccgtgggttg 
cttcgaccgg 
caggtcgtcg 
cgcgctggcc 
cctcggcgag 
cttggtggtc 
gatcgccgcg 
gatcgcggca 
gcagatcgcg 
cgcgttccac 
ggtgacgtac 
cgatgaggtg 
ggacggagtg 
gccgacgctg 
agcgtcgcgc 
ggtcgtcggt 
gctgccaacc 
ggcggacggc 
cgtgtcgacc 



ccggtccatg 
cgacgatgtc 
cgtgtacgtg 
cttcaaggag 
ggatagatcg 
aggacagggg 
caaggtgggc 
ggcgaccccg 
ggagcaggga 
gatgcggctg 
gtgggcggag 
cctgcgcgag 
cttgagcgag 
gccgttcagc 
ggccgcgctc 
tctgagcggg 
caccgagcgg 
gacggacgag 
tggtgagtaa 
cttctgctca 
ccgatcgcca 
ttctgggagc 
ctggtcggcg 
gtggacggct 
cctcagcaac 
ccccagtccc 
tcgcataccg 
acgctcagcg 
accgtcgaca 
cgcgctcgcg 
acgatgataa 
gacgcctcgg 
ctctccgacg 
aatcaggatg 
ttgcgcgagg 
cacggaacgg 
gggccggcgc 
cacctggagg 
gaactgatcc 
accgcgctcg 
gcgggggtga 
ccggccacgg 
gcgaagagcg 
tacccggagc 
gagcaccggc 
gcggcgcagg 
aagctcgcct 
tgggaggcgt 
gagctccatc 
ttgctggacc 
gcgctcttcc 
ctggtggccg 
gcgcgcggcc 
ccggaggccg 
gtcaatgggc 
gcggcgttcg 
tcgccgctca 
cggcggcctt 
agcgcgccgg 
aaggcgctgc 
ctcggccttg 
gccgggcgtg 
ggatcggtca 
tatccctggc 
accggccgtg 
catgccggtc 



gccgaggctg 
gctcggctcc 
gacgggacct 
tggatgtatc 
ctggacttct 
agccgcgccg 
cttacagcga 
aacgacggcg 
gcggcggcgc 
aatctgcgcc 
ctgctgaagg 
gcgctgcaga 
ctgttggggc 
aatctcggca 
ggcatcaccg 
agcttgctag 
gagaagagct 
cagaaggacg 
gggaccgagg 
tggacaagct 
tcgtaggcat 
tgctcgactc 
tccatcccag 
tcgacgccgc 
gcctgctgct 
tcgacggcag 
ttgcgcaaca 
tcgccgccgg 
cggcctgctc 
agagcgatct 
tgctggggcg 
ccaacgggtt 
cccagcgaca 
gccggtcgac 
cgctgcagag 
ggacctcgct 
gggccgatgg 
gcgctgcagg 
cgcgaaacct 
cgctggcgac 
gcgcgttcgg 
tgctcgcacc 
ccgccgcgct 
agggtctcgg 
tcgcggtggc 
ggcagacccc 
tcctgttcgc 
ggccggcgtt 
agccgctctg 
agacggcgtt 
ggtcgtgggg 
cctgcgtggc 
ggttgatgca 
acgtggctgc 
cggagcaggt 
cggcgcgggg 
tggatccgat 
cgatcgcgct 
gttactgggt 
acgcggccgg 
tgccggcctg 
acgaggctgc 
cctggtcggg 
agcgcgagcg 
ctcgggcggg 
tgcgcctgtg 



gcgcggaggt 
tctcgaagat 
tccagggcga 
ccaaggtgct 
tcgtcctgta 
ccggtgacgc 
tgagcatcaa 
gagcacggct 
tcgggcgctt 
agtggttgga 
agcgtgaccg 
gcgccaggcc 
gggggctgcg 

tggactcgct 
tgccggcgac 
acattctgtt 
tcgagaacga 
cgttgctcgc 
gagtatggcg 
tgcgaaaaag 
tggctgccgc 
aggccgagac 
cgaggaggtg 
gttctttggc 
ggaggtcacc 
ccgcaccggg 
gcggcgcgag 
acggttgtct 
gtcgtcgctc 
cgcgctggcg 
catccaggcg 
cgtccgtggg 
cggcgatcgg 
agggttgatg 
cgctcgcgtc 
cggcgacccg 
gagccgctgc 
cgtggcgggt 
ccatttccac 
ggagccggtg 
cctcagcggc 
ggcgacgccg 
ggacgcacag 
agacgtcgcg 
ggcgacctcg 
ggcaggcgcg 
cgggcagggc 
ccgcgagacc 
cgaggtgatg 
cacccagccg 
cgtggagccg 
gggtgtgttc 
ggcgctgccg 
cgcggtggcg 
ggtgatcgcg 
ggcgcgaacc 
gctggaggcg 
ggtgagcaac 
gcgtcacgcg 
tgcgggcctc 
cctgccggat 
gagcgcgcta 
tgtcttccct 
ttactggatc 
gggccacccc 
ggagacgacg 



gcagatcgtg 
cgaaccgtcg 
ctcctcgatg 
cggagcgtgg 
ttcctcgggc 
cttcttggac 
ctggggattg 
cgaataccgg 
gctcgcacga 
gttctatccc 
cgccgaccga 
cgaagatcgt 
ccttccgccg 
gataggcctg 
cctgctatgg 
tccgaatgcc 
tgccgcagat 
cgaaaagctg 
accacgaatg 
aacgcgtctt 
ttccccggcg 
gcggtccagc 
ccgcgctggg 
acctcgcctc 
tgggaagggc 
gtgttcctgg 
gagcaggacg 
tatacgctag 
gtggccatcc 
ggaggcgtca 
ctgtcgcccg 
gagggctgcg 
atctgggctc 
gcacccaatg 
gacgccgggg 
atcgaggtcg 
gtgctgggcg 
ttgatcaagg 
acgctcaatc 
ccgtggccgc 
accaacgtcc 
gggcgctcag 
gcggcgcggc 
ttcagcctgg 
cgcgaggcgc 
gcgcgcggca 
gcgcaggtgc 
ttcgaccggt 
tgggccgagc 
gcgctctttg 
gagctcgtcg 
tccctcgagg 
gccggcggcg 
ccgcacgcag 
ggcgccgaga 
aaaccgctgc 
ttccggcggg 
ctgagcggga 
cgagaggcgg 
ttcgtcgagg 
gccaggccgg 
gaggcgctgg 
tcgggcggac 
gaagcgccgg 
cttctgggtg 
ctggaccgaa 



gaggccgacg 
atgccgccgc 
ctggagctgg 
aacctgcacg 
acctcgcttc 
gccatcgcgc 
ctctccgaag 
gggatggaag 
cccagggcgc 
aacgcggccc 
ggcgcgtcga 
cagttgattc 
gagaggatcg 
gagctccgca 
acctacccta 



ggcgcgaccc 
ctcgaggctc 
gcgcagctcg 
ccgggaagct 
tggagcaaga 
gagcggacac 
cgctcgaccg 
ccggactgct 
gggaggcgcg 
tcgaggacgc 
gcgcatgcag 
catacgacat 
ggctgcaggg 
accttgcctg 
acatgctcct 
atggccactg 
gtatggtcgt 
tgatccgggg 
tgctcgctca 
ccatcggtta 
aggcgctgcg 
cagtgaagac 
cggcgctggc 
cgcggatccg 
gggcgggccg 
atgpcgtgct 
cggagctttt 
tctcagcgca 
tatcgacgcg 
tgcgaagcgc 
gggccgctto 
cgggcatggg 
gcgtcacgct 
cgggcagcag 
cgctggagta 
ctggccatag 
acgccgtgcg 
cgatggtatc 
cgttggtgtc 
aattcgtgca 
atgtctcgca 
tgactgagtc 
agccctgcac 
tgcgcttcgc 
tggggccgaa 
tgctgctccc 
gtgggttctg 
ggcgggtacc 
tcgatcgtga 
aagtcttttc 
agcggctgcc 



53880 
53940 
54000 
54060 
54120 
54180 
54240 
54300 
54360 
54420 
54480 
54540 
54600 
54660 
54720 
54780 
54840 
54900 
54960 
55020 
55080 
55140 
55200 
55260 
55320 
55380 
55440 
55500 
55560 
55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 



4-30582A 




16 



gtggctcggc 
gatggcgctg 
ggtgctcatc 
gaccgaggag 
tcgcgcgccc 
cccggcgagg 
ggctatctat 
cgccgagctg 
cggctccgcg 
tgttggcgcg 
ggtgcggctg 
tggtcaacag 
ggtggtcgcc 
cgacgcagac 
gatcacagcc 
ctcggcgctg 
tgcaggaatg 
gcacctcagc 
gctcgacgcg 
tggttgcgac 
gccgcggctg 
ggtgcaagcg 
ctgtatcagc 
cgagctactt 
tgcgcggctc 
tgacaggccg 
agccacgggg 
gctcgactcc 
agaaatcgag 
gggcgtgaac 
tacccatgtc 
cgaggcggcc 
ccacctgcag 
cgcggtgcga 
gaaccgtgcc 
gttcgtcaca 
gctttcgggc 
gaagctgggc 
gaatttttcc 
ccgtgcgctc 
gtcggggttg 
tcgcgcagcc 
cacgctggac 
cgcggacggc 
atggctggcc 
cgcagagcag 
gaaagcggac 
ggggatgccg 
gcagcagact 
cttgcacacg 
tgggcttttc 
cctttcgcat 
cacggaggtg 
gatgcggggc 
tcgcgtgcag 
aacagcggcc 
gaccgccggg 
ggggctgctg 
caagatcgag 
gctgcgcaac 
gtacccaacg 
tggcggcggg 
ccgctttcgt 
ctcgcccgag 
ggccatgtgg 
agaggcggcc 



gagcaccggg 
tcgtcggggg 
gagacgctga 
cgaccgggac 
ttccggatcc 
tcgaacctcg 
ggtgcgctcg 
tggcggggtg 
acagcctacc 
ttcgccgatc 
ttccagcggt 
gcctccagcc 
gagatctccc 
gactggttcc 
ggccggtggc 
aaggccgccg 
cgcgcgctcc 
agcctcgacg 
ccccggagcc 
agcgtgctct 
tggctcttga 
ccgctgttgg 
gtcgacctcg 
gcagatgatg 
gtccaccggc 
ttccggctag 
cggcgcgctc 
atcgacatcc 
ccgttggtgc 
ggccttgtgg 
accacgtcgg 
gcgatgcccc 
gcgggggagc 
tgggcgcagc 
tacctggagt 
gacgtgcatg 
gagcgcatcg 
aggcgcgacg 
ttctcgcagg 
ctcgacgagc 
cgcgttggcg 
gaggcattcc 
gacccggagg 
acctaccttg 
gagcggggcg 
cgagccgccg 
gtcgccgatc 
ctgcggggtg 
ccggcgcggt 
ctgacacgcg 
ggctcgccag 
caccgaaggg 
gggatggccg 
atcacccccg 
acgggggtga 
tcacggaggt 
gatcgggacc 
caggacgtcg 
gtggatgccc 
cgcatcgagg 
gtagcagcga 
tcggacacgg 
cctgtcgtca 
ggcttccgtt 
cacgatcgca 
tcgctgattc 



cgcaggggga 
ccgagatctt 
ccttcgcggg 
ggctgcggtt 
acgcccgcgg 
ccgccctgcg 
ccgagatggg 
agggcgaggc 
agctgcatcc 
gcgatgaggc 
ctcctgggga 
ggtggagcgc 
ggctggtggt 
tggagctgga 
tgctgctcgg 
gccatgtcgt 
tggccaacgc 
ggggcggcca 
cagatgtcga 
ccctggtgca 
cccgcggggc 
ggctgggccg 
atccagccga 
ccgaggagga 
tgcccgacgc 
agatcgatga 
ctggtccggg 
agctggcgtt 
tcggaagcga 
tgggccagcc 
ccacgctggt 
tcgcgtattt 
gggtgctgat 
gcgtgggcgc 
cgctgggcgt 
catggacgga 
acaagagcct 
actgcgccga 
tggacttgcg 
tgttcgggtt 
gatccctcac 
ggaggatggc 
tgcggatccg 
tgaccggcgg 
cggggcaact 
tggcggcgct 
ggtcacagat 
tcgtgcatgc 
tccgcacggt 
aagcgcctct 
gccagggcaa 
cgcagggcct 
ttgcgcaaga 
atgagggtct 
taccgatcac 
tgtcgcggct 
tgctcgaaca 
tgcgcgtgca 
cgctctcgag 
ctgcgctggg 
taacgcgctg 
acgaatcgac 
agccgcgggc 
cctggtcgga 
gcctcgcctc 
agcactatgc 



ggtcgtgttt 
gggcgatgga 
cgatacggcg 
ccaggtagcg 
cgtgctgcgc 
cgcccggctt 
gcttcaatac 
gctgggcagg 
ggtgctgctg 
gacgccgtgg 
gctatggtgc 
cgactttgag 
ggagcggctt 
ttgggagccc 
cgagggtggt 
cgtccacgcc 
gttcgacggc 
gctcggcccg 
tgccgatgcc 
agcgctggtc 
tcaggcggcc 
caccatcgcc 
gcctgaaggg 
ggtcgcgctg 
tcagcgccgg 
acccggcgcg 
cgaggtcgag 
gggcgttgct 
gtgcgccggg 
ggtgatcgcc 
gttgcctcgg 
gacggcctgg 
ccatgcggag 
cgaggtgtat 
gcggtacgtg 
cggcgagggt 
catggtcctg 
cacgcagcct 
gggaatgatg 
ggtcgcagcc 
gccaccgccg 
gcaaggacag 
cgctccggcc 
tctgggtggc 
ggtgctggtg 
ggaggcccac 
cgagcgggtc 
ggcaggtctc 
gatgggacct 
ttccttcttc 
ctatgccgca 
gccggcgctg 
aaaccgtggc 
gtcagctctg 
tccgcggcag 
ggtgaccacg 
gcttgcgtcg 
ggtctcgcat 
catgggcatg 
cgtcgccgcg 
gctgctcgac 
ggcgagcgcc 
tcgtctcttc 
gaagtctgag 
cgaggacgcg 
agacgcaccg 



cctggcgccg 
ccgatccagg 
gtaccggtcc 
agtcgggagc 
cggatcgggc 
catgccgccg 
ggcccggcgt 
gtgagactgc 
gacgcgtgcg 
gcgccggtgg 
catgcgcgcg 
ttgatggacg 
gcgagcggtg 
gcggcgctcg 
gggctcgggc 
gcgggggacg 
caggccccga 
gggctcgggg 
ctcgaatcgg 
ggcatggacc 
gccgccggcg 
ttggagcacg 
gaagccgatg 
cgcggtggcg 
gagaaggtcg 
ctggaccaac 
atctccgtcg 
cccaatgatc 
cgcatcgtcg 
cttgcggcgg 
cctctggggc 
tacgccctcg 
gccggtggtg 
gcgaccgccg 
agcgattccc 
gtggacgtcg 
cgcgcctgtg 
gggctgccgc 
ctcgatcaac 
ggtgccatca 
gtcgagacct 
catctcggga 
gaatccagcg 
ctcggtctgc 
ggccgctccg 
ggcgcgcgcg 
ctccgcgagg 
gtggatgacg 
aaggtccagg 
gtgctgtacg 
gccaacgcgt 
agcatcgact 
gcgcggcaga 
gcgcgcttgc 
tgggtggagt 
cagcgcgcgg 
gctgagccga 
gtgctgcgtc 
gactcgctga 
cctgcagcct 
gacgccctcg 
ggttcgttcg 
tgttttcacg 
tggagcgatc 
cctggtaaga 
tttgcgttag 



ggtacctgga 
tcacggatgt 
aggtggtgac 
cgggggaacg 
gcgtcgagac 
tgcccgctgc 
tgcgggggct 
ctgaggccgc 
tccaaatgat 
aggtgggctc 
tcgtgagcga 
gtacgggcgc 
tacgccggcg 
gtgggcccaa 
gctcgttgtg 
acacgagcac 
cggccgtggt 
cgcagggcgc 
cgctgatgcg 
tccgaaacgc 
atgtctccgt 
ccgagctgcg 
cttpgctggc 
accggctcgt 
agcccgccgg 
tggtgctccg 
aagcggcggg 
tgcctggaga 
ctgtgggcga 
gagtatttgc 
tctcggcgac 
acaaggtcgc 
tcggtctttg 
acacgcccga 
gctcgggccg 
tgctcgactc 
gtcgccttgt 
cgctcctacg 
cggcgaggat 
gcccactggg 
tcccgatctc 
agctcgtgct 
tcgccgtccg 
gcgtggccgg 
gtgcggcgag 
tcacggtggo 
ttaccgcgtc 
ggctgctgat 
gggccttgca 
cttctgcagc 
tcctcgacgc 

ggggcatgtt 

tctctcgcgg 
tcgagggtga 
tctacccggc 
tcgctgatcg 
gcgcgcgggc 
tccctgaaga 
tgagcctgga 
tggggtggac 
tcgtccggct 
tccacgtcct 
gttctggcgg 
tggaaatcgt 
agtacgtcca 
tagggttcag 



57840 
57900 . 
57960 
58020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
58560 
58620 
58680 
58740 
58800 
58860 
58920 
58980 
59040 
59100 
59160 
59220 
59280 
59340 
59400 
59460 
59520 
59580 
59640 
59700 
59760 
59820 
59880 
59940 
60000 
60060 
60120 
60180 
60240 
60300 
60360 
60420 
60480 
60540 
60600 
60660 
60720 
60780 
60840 
60900 
60960 
61020 
61080 
61140 
61200 
61260 
61320 
61380 
61440 
61500 
61560 
61620 
61680 
61740 
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cctgggtgtc 
ggctccgctg 
ggagatggag 
atccacccaa 
ggctccggcc 
catcgccggc 
caccacggag 
agggcgcgag 
gacgtcgtca 
tggttgggag 
cctggacgtt 
gaagcctgct 
gatcgagcgc 
cctcacccga 
agaagagtgg 
gaagtacgga 
cccgtcgttt 
ccagctgctc 
gggaatcccg 
gttccgtcgc 
cgatgaggag 
cgtcctcgat 
ggccgaggcc 
tatcgctgct 
gcggtcgccc 
cgatgaggtg 
ggacctggag 
gagcgccctg 
cacgggcgcg 
tcgcctcgag 
gaaagaaact 
catcttgaag 
cctcatgata 
caagcccatt 
gacgtcggaa 
gcacgccgct 
cggcctcacc 
cgcgcgcgcc 
cttcggcggc 
agtcctcttc 
gctcgtcctg 
gcgccccggg 
gctggtgcag 
tgcgcaagcc 
gacatccggc 
gaggccgacg 
cggtgggcgg 
cggctgggca 
atcgccggca 
ttccgcgcgg 
gccggcccgt 
gcggcgcgcc 
ctgcgcgcag 
gcctctctcg 
cggagccgcg 
acaggcgacg 
ctccagacac 
ctcgtcgcgc 
gtcttgtccg 
gacggcgctg 
cccgcctgac 
caggcccacg 
cacatcagag 
accgtccccg 
gactagcgtg 
atcggccggg 



cggttcgtca 
gccgtcttca 
accgatataa 
caagtccagg 
cccggggact 
tcggacgatg 
cgcttctata 
atcatgcaca 
ggccccgcgt 
cagcgtgggc 
tgcagtatag 
ttcgacttca 
ctgagagagg 
taccacgacg 
gaatcgagcg 
ttgttcgggc 
acgtcacgcg 
gatgctcgct 
atgcgcgcga 
ttcggctcgg 
accaagaccc 
gagcggcgca 
gacggcagca 
ggcaccgata 
gaggcgctcg 
ctccgcttcg 
tactgcgggg 
agagatggga 
agcctcgcgt 
gcggagatcg 
cccgtgtttg 
ccctccaaag 
cagctcgcgc 
gtcagcgcgc 
gaagtgccgg 
cgtcatgtcc 
gagcggcagg 
ttcggcgagc 
gtggtgctgg 
caggatccgg 
ctgctcatgg 
gcgctctcgg 
cgcatgcagg 
tgagcctcgg 
cgcccgccgc 
aggcgctccg 
aggcggcgca 
aggcgctcga 
gactccagaa 
cgacggagca 
accgcggatc 
ttcaccccgc 
ccgagcgggc 
ggcgcagccg 
ctcggtccgc 
acccgcccga 
tcgttcagct 
cactcggatc 
atgttgttgc 
ccgactggca 
cgcttttcgc 
accagcttcc 
actctccgct 
accagaacag 
cctccgctcg 
caggaggtac 



tggggacagc 
cgttgggcgg 
tagccaagct 
ccgatgctcg 
cgaaggagcc 
tgatcgtgcc 
tgcatctcct 
tcgtcgactc 
tcgaggcaaa 
gctggcggcc 
gagattttat 
agccgttcgc 
caacccccat 
tgtcggcggt 
cggagtactc 
tgccgccgga 
ccatcgacct 
ccggacaaga 
tcagcgctct 
cgactgcgcg 
tggtcgcgtc 
ggaacccgct 
ggctgagcac 
ccacgatcta 
agctggtgaa 
acaatatcct 
catcgatcaa 
ctgtattctc 
acggtagagg 
ccgtgggcac 
gataccaccc 
ctggatagct 
gcgggtgctg 
gaagatcgaa 
gcgccgccct 
ggccctgcac 
ttctgctctc 
tcgcgcggcg 
gcccgtccgt 
cggtcggggt 
cgggtatcga 
cgctcggcgc 
gcgcgttcac 
cgcctgctcg 
ggcccagctc 
cccgttccgc 
gcggccgcgg 
caaggtccct 
cgatgcggtc 
cgcgctccgc 
gagccgcgtg 
ggaccgtgta 
gctcatcgcg 
agcggcggcg 
gcagcggcta 
gggtgtcgaa 
gcttggcgtc 
ggatcttgtt 
accgcgcctc 
ccggcgcctc 
cgccgcacgc 
cagcaatctt 
cggctcgtcg 
ccgcatgcgg 
tgccgagatc 
cgacacgggc 



cgtggagctc 
cagcttgatc 
cttcttccga 
cgcagacaag 
gcccgtgaag 
tccgagcgac 
tcccggagat 
gcatctcaat 
atgatggcag 
ggcggcaggc 
gacacaggag 
gcctgggtac 
cttctactgg 
gttccgcgac 
gtcggccatt 
ggatcacgct 
gctgcgcgcc 
ggagttcgac 
gttgaaggtt 
cgcgctcggc 
cgtcaccgag 
cgaaaatgac 
gaaggagctg 
ccttatcgcg 
ggccgagccc 
cagaatagga 
gaaaggggag 
caggccagac 
cccccatgtc 
catcttccgt 
cgcgttccgg 
cgcgggggta 
tctgccgcgg 
tccacggccc 
gggagcgcaa 
ccgcgccgag 
gctcgtcgcc 
gctgcgccag 
cgtcggcgcg 
cgtgctctcc 
ggtcgatgtg 
gatcgcgccc 
gtgggatctc 
tacacctcgc 
gagccggact 
gacgcgatcg 
ctggagagcc 
ttcgcgcaca 
tggttcgatg 
gacgcggcgt 
tccgctgccg 
cccgcgtccg 
ctctacactg 
tgccggtggt 
gcgcgcgtcg 
cggattgccg 
gatgccgcct 
cgagcacgcg 
gcggtcgcac 
gccctgcgcg 
cgcgagcagg 
ttgcatggct 
gttcgacagc 
gtttctcgca 
ggctgtcctg 
gccgggcggg 



gccagtcgtt 
tcttcttcag 
aatgccgcgg 
gtcatcacag 
atcgcggtcc 
gttcaggatc 
cacgaatttc 
ccgctgctcg 
cctccctcgg 
cgcggaggcg 
caagcgaatc 
gcggaggacc 
gatgaaggcc 
gaacgcttcg 
cccgagctca 
cgggtccgca 
gaaatacagc 
gttgtgcggg 
ccggccgagt 
gtgggtttgg 
gggctcgcgc 
gtcttgacga 
gtcgcgctcg 
ttcgctgtgc 
gggctcatga 
actgtgcgtt 
atggtctttc 
gtgtttgatg 
tgccccgggg 
aggttccccg 
aacatcgaat 
tcgcttcccg 
gtgcgattcg 
ggagaagagc 
agctcgctcg 
gagccgcccg 
ctcgcgctcg 
cccgaggtgc 
ctcgctcctg 
ggcatctcct 
agcatcctgc 
ccgctgcgca 
gacgtctcgc 
cggtgctcgc 
cgccggatga 
ccgcgtactc 
tcgtgcggct 
cgacggccgg 
tcgccgcccg 
cggccatgga 
taggggagtt 
accagcagat 
cgttcgcccg 
tccctcttcg 
cggcagagat 
cagccctcat 
gggcactcgc 
tccttgctcg 
cgcgccgcca 
ccacccgggg 
ctcattcccg 
tcccctccct 



cggcgacggc 
acatgccccg 
tgcgacggca 
aggtgccgcc 



ccggcgcacc 
agatcacccc 
gtttcgtgcg 
acaccatggt 
ctatcgtcgc 
tacaatctcg 
tcgtcgatcg 
ccgcgaggac 
gcgcgcgaga 
catgagcctt 
agagtgagac 
cgttccccgc 
gctcctgggt 
cggtcagtcg 
gcgatatgaa 
agctcgtcaa 
gcaccgtcga 
attacgcgga 
gtgacgagaa 
tgccccaggt 
tgctccatga 
tgctgcttca 
tgggtgcgat 
tcaacctgct 
ggaacgcgct 
tcgccaggca 
tcctgatccc 
tgcgacggga 
tgtcccttgc 
agatgaagct 
cactcaacgt 
aacctcattc 
atccagcgga 
ccgtccgggt 
ttcgcgctca 
ccctgatgca 
tcctcctgac 
tcggcgagct 
ggttccatcg 
ggataggcgc 
gcaaggaggc 
cgccggggcc 
cgcgacgctc 
tccgcccgcg 
cgaggccgac 
ggaggccgtt* 
cgcgatcgtg 
cgtctcccag 
gtacgcgagc 
ggcgctcgcg 
tcggggggag 
cctgaccgcg 
tgaggagtga 
caaccatgac 
cgctggagcg 
tgcggatccc 
cgaaggtcag 
aatactcccg 
cgatgctatc 
tttgcgcctc 
acaccgagat 
cacgacacgt 
cacgagcaga 
acatccttgc 
atatcctgcg 
acgggctcga 



61800 
61860 
61920 
61980 
62040 
62100 
62160 
62220 
62280 
62340 
62400 
62460 
62520 
62580 
62640 
62700 
62760 
62820 
62880 
62940 
63000 
63060 
63120 
63180 
63240 
63300 
63360 
63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 



4-30582A 



-18 



aatgtgctgc 
ggggtgacgc 
caccaccgtg 
tccgtgtagg 
tgctgcacga 
atcggctcgg 
gtcacccggt 
gcgtcccggt 
ggtccgttgc 
ggcttctcca 
agcgacggcc 
ctccttgccg 
gatgccggcg 
cggcgcctct 
acccgccgat 
ggcggctgtg 
gaagcagtga 
cgccggatag 
ccgagcgaga 
gccgacgtcg 
ccgcccgagc 
gagtccccgt 
ctggccgaga 
agcagccctc 
cgcgctatca 
tccatcatat 
cggcgcggct 
ggatcggcgg 
cgtggcgatc 
ctccgagggc 
gcaccggagc 
tttctggacg 
cgaagggatc 
cgtcctcggc 
caagaagaag 
gttcacgatc 
cgtctggctc 
cggcctcgtg 

gggggagagc 
tggcgtgtcg 
cgtgctgcgc 
gcgcatcgag 
caggtcgctc 
cgccatcctc 
aaaccgtgca 
tcgcgaggtc 
caggcttcga 
ggcgagatga 
agcgcggtcg 
gcggcgccag 
atcctccgct 



ggcaggcgcc 
cgcaaacggg 
cgcactcgtg 
cgatcgtgct 
caatgggaac 
gttcggtcag 
acggcccggc 
ccgacgcatt 
tccggcctgg 
tatgtcctcc 
tcttctcccg 
accggcgcgc 
cgacgaggcc 
gcaggccggc 
tcacatccca 
gtaccggcca 
ggcgatcgag 
ggctcacgct 
aaatcgtgca 
accccgacgc 
gcatcgaggc 
ggcaacgggt 
cgaccgtgcc 
agtctccatc 
gcagcgctga 
catccctgcg 
ctacgtgcgc 
agcatggaga 
gcggcggtca 
gtgcactccc 
gcacgcccgc 
ctgatcgtcg 
ttgcacctct 
gcagcggccg 
gacggacagg 
gtcctggagg 
gggcaccgcc 
ctcgccgcgg 
gcggacaggg 
gcggtggggc 
atcgagttcg 
acccggatac 
caccagcgcg 
cgcggcgccc 
gacgtcgtcc 
cggcagcgcc 
ggggggcgac 
cggctggcgt 
ctcgcgcgat 
catggcggtc 
tccagggttc 



tccatgcccg 
agatgctccc 
agctccagct 
gatcagcgcg 
gtccgattcg 
atcgttgaac 
ggggtcgcgg 
caacaggcag 
gatgtagccc 
ctgctggctc 
acgcgctcgg 
cgagcgccga 
gccccgcgtc 
ctgagcgttg 
ccgcggcacg 
gcccggacgc 
gtggcagatg 
cggtctcctc 
gggcacgcgg 
cgcgaccacg 
cggcagcgag 
cggagtgctc 
gcatgccaac 
ttctgccgcc 
gcccgccagc 
tactcctcca 



gacaggagag 
aagaatcgag 
agttcatcgc 
tcgtcgatac 
ccgacgccga 
ccatcatgat 
tgcacccgcg 
tcttcgaggg 
gctacctcgc 
actccgcggc 
tgggaaaccc 
tcgcggtctt 
agctcctcgc 
ggcccctgac 
acgccgcgct 
ggagcgagcg 
cgagggcgtg 
gggctcaggt 
ttcgacgcga 
ggagcccggg 
gtcatgagga 
cgcggcctcc 
ctcgcccggc 
gtctcgctgt 
gattcggcca 



cagccgggaa 
ggagaggcgc 
cctcggcata 
ttctccgcct 
atcacgctgg 
cggacgtgcc 
tcgctgaagt 
gccgtctcat 
tctgcgattg 
ctctttggct 
ggatccatgg 
cgggctttga 
tgatcccgat 
cgcggtcatg 
acgcttgctc 
gaggcccgag 
aaacacgttg 
gcgagcatgg 
ctcgcgcccg 
cggctggcgg 
cggttcgtcg 
gactacaacg 
ttcgagctgc 
gtcatcgggc 
aggccccaga 
gcgacggccg 
cgtcctggcg 
gatcgcgatc 
cgccgccgtg 
tgcagacggg 
gcatccgttc 
cttcgccgcg 
ccagatcgag 
gacgtcgctc 
ggcgatgcgg 
gctcgccggg 
ctacctcgac 
cctcgccagc 
cgcgatccgc 
gatgcacttc 
cacggcgtcc 
acccgacgtg 
acgcgccgtg 
ggccctcgca 
ggtacgctgg 
cgggccgggc 
aggccagggc 
ccggcgccgg 
gccggctcat 
tccagctcgg 
aggtgacgag 



cgcggcgccc 
cgggcacagc 
gaagagaccg 
gacgcgagtc 
catagtccgt 
gggtgcgcct 
agacggtgat 
ggctcgtcat 
cccagcgcgt 
gcctccctct 
ctgaggatcc 
aagcacgcga 
cgtgacatcg 
gtcgtcctcg 
aaaccgcggc 
agggacagtg 
acacgggccg 
cgctcgccgg 
gcgccgatgc 
tggacgtcgt 
tctggcagcg 
ctgccagccg 
tcatcaccgt 
cgacgtccgt 
gccctgcctc 
cgtcgaagca 
cggcctgcgc 
tacggcgcca 
accggcagct 
ctcctcctcc 
ggccacggca 
ggcggcggcg 
gatccgacgt 
atcatctcga 
tccagcaagg 
ctcaccatcg 
ggcgcggcgt 
cagagccgtg 
gcgctcgcca 
ggtccgcacg 
ggggtcgcgg 
aagcacatct 
gagagaccgc 
gcagggcgcg 
ttgcaagtcg 
gcacgaaggc 
gcatggggcg 
cgcgcttcgc 
cgcctccgtg 
gatcatcgag 
ctccgatatc 



ggccagcctc 
cgagcgccgt 
tcactcccgg 
gagccgggta 
atcgcgcggg 
cgctgggacg 
ggcgacctgc 
ctgcggctcg 
ccgcccgatc 
gctgtccagg 
tcgccgagcg 
ccggacacgt 
cgacgtccgc 
cgtcaccgcc 
gagacggccg 
ggticcgccgt 
acgagtcggc 
ctgtggcggc 
gcacgtcgcc 
tcacctctcg 
tccgagctcc 
aagaggcaag 
cgagaagcag 
cgggtaacat 
gatcgccttc 
accgccgtgc 
atcgctggaa 
tcgcagccaa 
cggcgatgct 
tgctcggcaa 
aggagctcta 
tctcgatcta 
ggaactacgt 
tccacgagtt 
acccgacgac 
ccttcctcgg 
cgatcggcat 
ggctcctcgt 
gcgcagatcc 
aagtcctggt 
aggcgatcga 
acgpcgaggc 
gcgcggcctc 
cctggcgggc 
tcacgccgta 
gcggcgagcg* 
atgctcggcg 
gccgcgctcc 
tccctcgccg 
cgcctgcccg 



65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
66240 
66300 
66360 
66420 
66480 
66540 
66600 
66660 
66720 
66780 
66840 
66900 
66960 
67020 
67080 
67140 
67200 
67260 
67320 
67380 
67440 
67500 
67560 
67620 
67680 
67740 
67800 
67860 
67920 
67980 
68040 
68100 
68160 
68220 
68280 
68340 
68400 
68460 
68520 
68580 
68640 
68700 
68750 



<210> 2 
<211> 1421 
<212> PRT 

<213> Sorangium cellulosum 
<400> 2 

Val Ala Asp Arg Pro lie Glu Arg Ala Ala Glu Asp Pro He Ala He 
15 10 15 

Val Gly Ala Ser Cys Arg Leu Pro Gly Gly Val He Asp Leu Ser Gly 
20 25 30 

Phe Trp Thr Leu Leu Glu Gly Ser Arg Asp Thr Val Gly Arg Val Pro 
35 40 45 
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Ala Glu Arg Trp Asp Ala Ala Ala Trp Phe Asp Pro Asp Pro Asp Ala 
50 ~ " 55 60 

Pro Gly Lys Thr Pro Val Thr Arg Ala Ser Phe Leu Ser Asp Val Ala 
65 70 75 80 

Cys Phe Asp Ala Ser Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Arg 
85 90 95 

Met Asp Pro Ala His Arg Leu Leu Leu Glu Val Cys Trp Glu Ala Leu 
100 105 11° 

Glu Asn Ala Ala lie Ala Pro Ser Ala Leu Val Gly Thr Glu Thr Gly 
115 120 I 25 

Val Phe lie Gly He Gly Pro Ser Glu Tyr Glu Ala Ala Leu Pro Gin 
130 " 135 140 

Ala Thr Ala Ser Ala Glu He Asp Ala His Gly Gly Leu Gly Thr Met 
145 150 155 160 

Pro Ser Val Gly Ala Gly Arg He Ser Tyr Ala Leu Gly Leu Arg Gly 
165 170 I 75 

Pro Cys Val Ala Val Asp Thr Ala Tyr Ser Ser Ser Leu Val Ala Val 
180 185 19° 

His Leu Ala Cys Gin Ser Leu Arg Ser Gly Glu Cys Ser Thr Ala Leu 
195 200 205 

Ala Gly Gly Val Ser Leu Met Leu Ser Pro Ser Thr Leu Val Trp Leu 
210 215 220 

Ser Lys Thr Arg Ala Leu Ala Arg Asp Gly Arg Cys Lys Ala Phe Ser 
225 230 235 240 

Ala Glu Ala Asp Gly Phe Gly Arg Gly Glu Gly Cys Ala Val Val Val 
245 250 255 

Leu Lys Arg Leu Ser Gly Ala Arg Ala Asp Gly Asp Arg lie Leu Ala 
260 265 270 

Val He Arg Gly Ser Ala He Asn His Asp Gly Ala Ser Ser Gly Leu 
275 280 285 

Thr Val Pro Asn Gly Ser Ser Gin Glu He Val Leu Lys Arg Ala Leu 
290 295 300 

Ala Asp Ala Gly Cys Ala Ala Ser Ser Val Gly Tyr Val Glu Ala His 
305 310 315 

Gly Thr Gly Thr Thr Leu Gly Asp Pro He Glu He Gin Ala Leu Asn 
325 330 335 

Ala Val Tyr Gly Leu Gly Arg- Asp Val Ala Thr Pro Leu Leu He Gly 
340 345 350 

Ser Val Lys Thr Asn Leu Gly His Pro Glu Tyr Ala Ser Gly He Thr 
355 360 365 

Gly Leu Leu Lys Val Val Leu Ser Leu Gin His Gly Gin He Pro Ala 
370 375 380 

His Leu His Ala Gin Ala Leu Asn Pro Arg He Ser Trp Gly Asp Leu 
385 390 395 400 
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Arg Leu Thr Val Thr Arg Ala Arg Thr Pro Trp Pro Asp Trp Asn Thr 
405 410 415 

Pro Ara Arg Ala Gly Val Ser Ser Phe Gly Met Ser Gly Thr Asn Ala 
420 425 430 

His Val Val Leu Glu Glu Ala Pro Ala Ala Thr Cys Thr Pro Pro Ala 
435 440 445 

Pro Glu Arg Pro Ala Glu Leu Leu Val Leu Ser Ala Arg Thr Ala Ser 
450 ~ 455 460 

Ala Leu Asp Ala Gin Ala Ala Arg Leu Arg Asp His Leu Glu Thr Tyr 
465 ~ 470 475 480 

Pro Ser Gin Cys Leu Gly Asp Val Ala Phe Ser Leu Ala Thr Thr Arg 
485 490 495 

Ser Ala Met Glu His Arg Leu Ala Val Ala Ala Thr Ser Arg Glu Gly 
500 505 510 

Leu Arg Ala Ala Leu Asp Ala Ala Ala Gin Gly Gin Thr Ser Pro Gly 
515 520 525 

Ala Val Arg Ser He Ala Asp Ser Ser Arg Gly Lys Leu Ala Phe Leu 
530 " 535 540 

Phe Thr Gly Gin Gly Ala Gin Thr Leu Gly Met Gly Arg Gly Leu Tyr 
545 550 555 560 

Asp Val Trp Ser Ala Phe Arg Glu Ala Phe Asp Leu Cys Val Arg Leu 
565 570 575 

Phe Asn Gin Glu Leu Asp Arg Pro Leu Arg Glu Val Met Trp Ala Glu 
580 585 590 

Pro Ala Ser Val Asp Ala Ala Leu Leu Asp Gin Thr Ala Phe Thr Gin 
595 600 605 

Pro Ala Leu Phe Thr Phe Glu Tyr Ala Leu Ala Ala Leu Trp Arg Ser 
610 615 620 

Trp Gly Val Glu Pro Glu Leu Val Ala Gly His Ser He Gly Glu Leu 
625 630 635 640 

Val Ala Ala Cys Val Ala Gly Val Phe Ser Leu Glu Asp Ala Val Phe 
645 650 655 

Leu Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu Pro Ala Gly Gly 
660 ~ 665 670 

Ala Met Val Ser He Glu Ala Pro Glu Ala Asp Val Ala Ala Ala Val 
675 680 685 

Ala Pro His Ala Ala Ser Val Ser He Ala Ala Val Asn Ala Pro Asp 
690 695 700 

Gin Val Val He Ala Gly Ala Gly Gin Pro Val His Ala He Ala Ala 
705 710 715 720 

Ala Met Ala Ala Arg Gly Ala Arg Thr Lys Ala Leu His Val Ser His 
725 730 735 

Ala Phe His Ser Pro Leu Met Ala Pro Met Leu Glu Ala Phe Gly Arg 
740. 745 750 
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Val Ala Glu Ser Val Ser Tyr Arg Arg Pro Ser He Val Leu Val Ser 
755 760 765 

Asn Leu Ser Gly Lys Ala Cys Thr Asp Glu Val Ser Ser Pro Gly Tyr 
770 " 775 780 

Trp Val Arg His Ala Arg Glu Val Val Arg Phe Ala Asp Gly Val Lys 
785 790 795 800 

Ala Leu His Ala Ala Gly Ala Gly Thr Phe Val Glu Val Gly Pro Lys 
805 810 815 

Ser Thr Leu Leu Gly Leu Val Pro Ala Cys Met Pro Asp Ala Arg Pro 
820 825 830 

Ala Leu Leu Ala Ser Ser Arg Ala Gly Arg Asp Glu Pro Ala Thr Val 
835 840 845 

Leu Glu Ala Leu Gly Gly Leu Trp Ala Val Gly Gly Leu Val Ser Trp 

|:A 850 855 860 

~ Ala Gly Leu Phe Pro Ser Gly Gly Arg Arg Val Pro Leu Pro Thr Tyr 
U 865 870 875 880 

U 

£ Pro Trp Gin Arg Glu Arg Tyr Trp He Asp Thr Lys Ala Asp Asp Ala 

885 890 895 

'■-«-! 

Ala Arg Gly Asp Arg Arg Ala Pro Gly Ala Gly His Asp Glu Val Glu 
S| " 900 905 910 

, Glu Gly Gly Ala Val Arg Gly Gly Asp Arg Arg Ser Ala Arg Leu Asp 

; - 915 920 925 

^ His Pro Pro Pro Glu Ser Gly Arg Arg Glu Lys Val Glu Ala Ala Gly 

iy 930 935 940 

1:3 Asp Arg Pro Phe Arg Leu Glu He Asp Glu Pro Gly Val Leu Asp His 

M 945 950 955 960 

Leu Val Leu Arg Val Thr Glu Arg Arg Ala Pro Gly Leu Gly Glu Val 
. 965 970 975 

Glu He Ala Val Asp Ala Ala Gly Leu Ser Phe Asn Asp Val Gin Leu 
980 985 990 

Ala Leu Gly Met Val Pro Asp Asp Leu Pro Gly Lys Pro Asn Pro Pro 
995 1000 1005 

Leu Leu Leu Gly Gly Glu Cys Ala Gly Arg He Val Ala Val Gly Glu 
1010 1015 1020 

Gly Val Asn Gly Leu Val Val Gly Gin Pro Val He Ala Leu Ser Ala 
1025 1030 1035 1040 

Gly Ala Phe Ala Thr His Val Thr Thr Ser Ala Ala Leu Val Leu Pro 
1045 1050 1055 

Arg Pro Gin Ala Leu Ser Ala He Glu Ala Ala Ala Met Pro Val Ala 
1060 1065 1070 • 

Tyr Leu Thr Ala Trp Tyr Ala Leu Asp Arg He Ala Arg Leu Gin Pro 
1075 1080 1085 

Glv Glu Arg Val Leu He His Ala Ala Thr Gly Gly Val Gly Leu Ala 
1090 1095 1100 



1 
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Ala Val Gin Trp Ala Gin His Val Gly Ala Glu Val His Ala Thr Ala 
1105 1110 1115 H20 

Glv Thr Pro Glu Lys Arg Ala Tyr Leu Glu Ser Leu Gly Val Arg Tyr 
y 1125 1130 1135 

Val Ser Asp Ser Arg Ser Asp Arg Phe Val Ala Asp Val Arg Ala Trp 
1140 1145 1150 

Thr Gly Gly Glu Gly Val Asp Val Val Leu Asn Ser Leu Ser Gly Glu 
1155 1160 1165 

Leu lie Asp Lys Ser Phe Asn Leu Leu Arg Ser His Gly Arg Phe Val 
1170 ~ 1175 1180 

Glu Leu Gly Lys Arg Asp Cys Tyr Ala Asp Asn Gin Leu Gly Leu Arg 
1185 1190 1195 1200 

Pro Phe Leu Arg Asn Leu Ser Phe Ser Leu Val Asp Leu Arg Gly Met 
!»* 12 05 1210 1215 

□ 

Met Leu Glu Arg Pro Ala Arg Val Arg Ala Leu Leu Glu Glu Leu Leu 
j;T 1220 1225 1230 

P Glv Leu He Ala Ala Gly Val Phe Thr Pro Pro Pro He Ala Thr Leu 
'"4 1235 1240 1245 

* t Pro lie Ala Arg Val Ala Asp Ala Phe Arg Ser Met Ala Gin Ala Gin 
* 12 50 ~ 1255 1260 

M His Leu Gly Lys Leu Val Leu Thr Leu Gly Asp Pro Glu Val Gin lie 

|.A 1265 1270 1275 1280 

{ :'t Arg He Pro Thr His Ala Gly Ala Gly Pro Ser Thr Gly Asp Arg Asp 
1:4 1285 1290 1295 

i'l Leu Leu Asp Arg Leu Ala Ser Ala Ala Pro Ala Ala Arg Ala Ala Ala 
1300 1305 1310 

Leu Glu Ala Phe Leu Arg Thr Gin Val Ser Gin Val Leu Arg Thr Pro 
1315 1320 1325 

Glu He Lys Val Gly Ala Glu Ala Leu Phe Thr Arg Leu Gly Met Asp 
1330 * 1335 1340 

Ser Leu Met Ala Val Glu Leu Arg Asn Arg lie Glu Ala Ser Leu Lys 
1345 1350 1355 1360 

Leu Lvs Leu Ser Thr Thr Phe Leu Ser Thr Ser Pro Asn He Ala Leu 
1365 1370 1375 

Leu Ala Gin Asn Leu Leu Asp Ala Leu Ala Thr Ala Leu Ser Leu Glu 
1380 1385 1390 

Arg Val Ala Ala Glu Asn Leu Arg Ala Gly Val Gin Asn Asp Phe Val 
1395 1400 1405 

Ser Ser Gly Ala Asp Gin Asp Trp Glu He He Ala Leu 
1410 1415 1420 

<210> 3 

<211> 1410 

<212> PRT 

<213> Sorangium cellulosum 
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Met°Thr lie Asn Gin Leu Leu Asn Glu Leu Glu His Gin Gly lie Lys 
1 5 10 15 

Leu Ala Ala Asp Gly Glu Arg Leu Gin lie Gin Ala Pro Lys Asn Ala 
20 25 30 

Leu Asn Pro Asn Leu Leu Ala Arg He Ser Glu His Lys Ser Thr He 
35 40 45 

Leu Thr Met Leu Arg Gin Arg Leu Pro Ala Glu Ser He Val Pro Ala 
50 55 60 

Pro Ala Glu Arg His Ala Pro Phe Pro Leu Thr Asp He Gin Glu Ser 
65 70 75 80 

Tyr Trp Leu Gly Arg Thr Gly Ala Phe Thr Val Pro Ser Gly He His 
85 90 95 

i«* Ala Tyr Arg Glu Tyr Asp Cys Thr Asp Leu Asp Val Pro Arg Leu Ser 

i>-| 100 105 HO 

i; 3 Arg Ala Phe Arg Lys Val Val Ala Arg His Asp Met Leu Arg Ala His 

j.* H5 120 125 

=^ Thr Leu Pro Asp Met Met Gin Val He Glu Pro Lys Val Asp Ala Asp 
130 135 140 

He Glu He He Asp Leu Arg Gly Leu Asp Arg Ser Thr Arg Glu Ala 
145 150 155 160 

!"*' Arg Leu Val Ser Leu Arg Asp Ala Met Ser His Arg He Tyr Asp Thr 
{«* 165 170 175 

Lab 

Ui Glu Arg Pro Pro Leu Tyr His Val Val Ala Val Arg Leu Asp Glu Arg 



180 185 190 

Gin Thr Arg Leu Val Leu Ser He Asp Leu He Asn Val Asp Leu Gly 
195 200 205 

Ser Leu Ser He He Phe Lys Asp Trp Leu Ser Phe Tyr Glu Asp Pro 
210 215 220 

Glu Thr Ser Leu Pro Val Leu Glu Leu Ser Tyr Arg Asp Tyr Val Leu 
225 230 235 240 « 

Ala Leu Glu Ser Arg Lys Lys Ser Glu Ala His Gin Arg Ser Met Asp 
245 250 255 

Tyr Trp Lys Arg Arg He Ala Glu Leu Pro Pro Pro Pro Thr Leu Pro 
260 265 270 

Met Lys Ala Asp Pro Ser Thr Leu Lys Glu He Arg Phe Arg His Thr 
275 280 285 

Glu Gin Trp Leu Pro Ser Asp Ser Trp Gly Arg Leu Lys Arg Arg Val 
290 295 300 

Gly Glu Arg Gly Leu Thr Pro Thr Gly Val He Leu Ala Ala Phe Ser 
305 ~ 310 315 320 

Glu Val He Gly Arg Trp Ser Ala Ser Pro Arg Phe Thr Leu Asn He 
325 330 335 

Thr Leu Phe Asn Arg Leu Pro Val His Pro Arg Val Asn Asp He Thr 
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340 345 350 

Glv Asp Phe Thr Ser Met Val Leu Leu Asp lie Asp Thr Thr Arg Asp 
* 355 360 365 

Lvs Ser Phe Glu Gin Arg Ala Lys Arg lie Gin Glu Gin Leu Trp Glu 
370 375 380 

Ala Met Asp His Cys Asp Val Ser Gly lie Glu Val Gin Arg Glu Ala 
385 ^ 390 395 400 

Ala Arg Val Leu Gly lie Gin Arg Gly Ala Leu Phe Pro Val Val Leu 
405 410 415 

Thr Ser Ala Leu Asn Gin Gin Val Val Gly Val Thr Ser Leu Gin Arg 
420 425 430 

Leu Gly Thr Pro Val Tyr Thr Ser Thr Gin Thr Pro Gin Leu Leu Leu 
435 ^ 440 445 

Asp His Gin Leu Tyr Glu His Asp Gly Asp Leu Val Leu Ala Trp Asp 
450 455 460 

He Val Asp Gly Val Phe Pro Pro Asp Leu Leu Asp Asp Met Leu Glu 
465 470 475 480 

Ala Tyr Val Val Phe Leu Arg Arg Leu Thr Glu Glu Pro Trp Gly Glu 
485 490 49^ 

Gin Val Arg Cys Ser Leu Pro Pro Ala Gin Leu Glu Ala Arg Ala Ser 
500 505 510 

Ala Asn Ala Thr Asn Ala Leu Leu Ser Glu His Thr Leu His Gly Leu 
515 520 525 

Phe Ala Ala Arg Val Glu Gin Leu Pro Met Gin Leu Ala Val Val Ser 
530 535 540 

Ala Arg Lys Thr Leu Thr Tyr Glu Glu Leu Ser Arg Arg Ser Arg Arg 
545 550 555 560 

Leu Gly Ala Arg Leu Arg Glu Gin Gly Ala Arg Pro Asn Thr Leu Val 
565 570 575 

Ala Val Val Met Glu Lys Gly Trp Glu Gin Val Val Ala Val Leu Ala 
580 585 590 

Val Leu Glu Ser Gly Ala Ala Tyr Val Pro He Asp Ala Asp Leu Pro 
595 600 605 

Ala Glu Arg lie. His Tyr Leu Leu Asp His Gly Glu Val Lys Leu Val 
610 615 620 

Leu Thr Gin Pro Trp Leu Asp Gly Lys Leu Ser Trp Pro Pro Gly lie 
625 630 635 640 

Gin Arg Leu Leu Val Ser Glu Ala Gly Val Glu Gly Asp Gly Asp Gin 
645 650 655 

Pro Pro Met Met Pro He Gin Thr Pro Ser Asp Leu Ala Tyr Val He 
660 665 670 

Tvr Thr Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Met He Asp His 
675 680 685 

Arg Gly Ala Val Asn Thr He Leu Asp He Asn Glu Arg Phe Glu He 
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690 695 700 

Gly Pro Gly Asp Arg Val Leu Ala Leu Ser Ser Leu Ser Phe Asp Leu 
705 710 715 720 

Ser Val Tyr Asp Val Phe Gly lie Leu Ala Ala Gly Gly Thr He Val 
725 730 735 

Val Pro Asp Ala Ser Lys Leu Arg Asp Pro Ala His Trp Ala Glu Leu 
740 745 750 

He Glu Arg Glu Lys Val Thr Val Trp Asn Ser Val Pro Ala Leu Met 
755 ~ 760 765 

Ara Met Leu Val Glu His Phe Glu Gly Arg Pro Asp Ser Leu Ala Arg 
770 775 780 

Ser Leu Arg Leu Ser Leu Leu Ser Gly Asp Trp He Pro Val Gly Leu 
785 790 795 800 

Pro Gly Glu Leu Gin Ala He Arg Pro Gly Val Ser Val He Ser Leu 
805 810 815 

Glv Gly Ala Thr Glu Ala Ser He Trp Ser He Gly Tyr Pro Val Arg 
* 820 825 830 

Asn Val Asp Leu Ser Trp Ala Ser He Pro Tyr Gly Arg Pro Leu Arg 
835 840 845 

Asn Gin Thr Phe His Val Leu Asp Glu Ala Leu Glu Pro Arg Pro Val 
850 855 860 

Trp Val Pro Gly Gin Leu Tyr He Gly Gly Val Gly Leu Ala Leu Gly 
865 870 875 880 

Tyr Trp Arg Asp Glu Glu Lys Thr Arg Lys Ser Phe Leu Val His Pro 
885 890 895 

Glu Thr Gly Glu Arg Leu Tyr Lys Thr Gly Asp Leu Gly Arg Tyr Leu 
900 905 910 

Pro Asp Gly Asn He Glu Phe Met Gly Arg Glu Asp Asn Gin He Lys 
915 920 925 

Leu Arg Gly Tyr Arg Val Glu Leu Gly Glu He Glu Glu Thr Leu Lys 
930 935 940 

Ser His Pro Asn Val Arg Asp Ala Val He Val Pro Val Gly Asn Asp 
945 950 955 960 

Ala Ala Asn Lys Leu Leu Leu Ala Tyr Val Val Pro Glu Gly Thr Arg 
965 970 975 

Arq Arg Ala Ala Glu Gin Asp Ala Ser Leu Lys Thr Glu Arg He Asp 
9 980 985 990 

Ala Arg Ala His Ala Ala Glu Ala Asp Gly Leu Ser Asp Gly Glu Arg 
995 1000 1005 

Val Gin Phe Lys Leu Ala Arg His Gly Leu Arg Arg Asp Leu Asp Gly 
1010 1015 1° 20 

Lys Pro Val Val Asp Leu Thr Gly Gin Asp Pro Arg Glu Ala Gly Leu 
1025 1030 1035 1040 

Asp Val Tyr Ala Arg Arg Arg Ser Val Arg Thr Phe Leu Glu Ala Pro 
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1045 1050 1055 

He Pro Phe Val Glu Phe Gly Arg Phe Leu Ser Cys Leu Ser Ser Val 
1060> 1065 1070 

Glu Pro Asp Gly Ala Thr Leu Pro Lys Phe Arg Tyr Pro Ser Ala Gly 
1075 1080 1085 

Ser Thr Tyr Pro Val Gin Thr Tyr Ala Tyr Val Lys Ser Gly Arg He 
1090 1095 1100 

Glu Gly Val Asp Glu Gly Phe Tyr Tyr Tyr His Pro Phe Glu His Arg 
1105 1110 1115 112° 

Leu Leu Lys Leu Ser Asp His Gly He Glu Arg Gly Ala His Val Arg 
1125 1130 1135 

Gin Asn Phe Asp Val Phe Asp Glu Ala Ala Phe Asn Leu Leu Phe Val 
1140 1145 1150 

Gly Arg He Asp Ala He Glu Ser Leu Tyr Gly Ser Ser Ser Arg Glu 
1155 1160 1165 

Phe Cys Leu Leu Glu Ala Gly Tyr Met Ala Gin Leu Leu Met Glu Gin 
1170 1175 1180 

Ala Pro Ser Cys Asn He Gly Val Cys Pro Val Gly Gin Phe Asn Phe 
1185 1190 1195 1200 

Glu Gin Val Arg Pro Val Leu Asp Leu Arg His Ser Asp Val Tyr Val 
1205 1210 1215 

His Gly Met Leu Gly Gly Arg Val Asp Pro Arg Gin Phe Gin Val Cys 
1220 1225 1230 

Thr Leu Gly Gin Asp Ser Ser Pro Arg Arg Ala Thr Thr Arg Gly Ala 
1235 1240 1245 

Pro Pro Gly Arg Glu Gin His Phe Ala Asp Met Leu Arg Asp Phe Leu 
1250 1255 1260 

Arg Thr Lys Leu Pro Glu Tyr Met Val Pro Thr Val Phe Val Glu Leu 
1265 1270 1275 1280 

Asp Ala Leu Pro Leu Thr Ser Asn Gly Lys Val Asp Arg Lys Ala Leu 
1285 1290 1295 

Arg Glu Arg Lys Asp Thr Ser Ser Pro Arg His Ser Gly His Thr Ala 
1300 1305 1310 

Pro Arg Asp Ala Leu Glu Glu He Leu Val Ala Val Val Arg Glu Val 
1315 1320 1325 

Leu Gly Leu Glu Val Val Gly Leu Gin Gin Ser Phe Val Asp Leu Gly 
1330 1335 1340 

Ala Thr Ser He His He Val Arg Met Arg Ser Leu Leu Gin Lys Arg 
1345 1350 1355 1360 

Leu Asp Arg Glu He Ala He Thr Glu Leu Phe Gin Tyr Pro Asn Leu 
1365 1370 1375 

Gly Ser Leu Ala Ser Gly Leu Arg Arg Asp Ser Arg Asp Leu Asp Gin 
1380 1385 1390 

Arg Pro Asn Met Gin Asp Arg Val Glu Val Arg Arg Lys Gly Arg Arg 
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1395 1400 1405 

Arg Ser 
1410 

<210> 4 
<211> 1832 
<212> PRT 

<213> Sorangium cellulosum 

Met°Glu Glu Gin Glu Ser Ser Ala He Ala Val He Gly Met Ser Gly 
15 10 15 

Arg Phe Pro Gly Ala Arg Asp Leu Asp Glu Phe Trp Arg Asn Leu Arg 
20 25 30 

Asd Gly Thr Glu Ala Val Gin Arg Phe Ser Glu Gin Glu Leu Ala Ala 
35 40 45 

Ser Gly Val Asp Pro Ala Leu Val Leu Asp Pro Ser Tyr Val Arg Ala 
50 55 60 

Gly Ser Val Leu Glu Asp Val Asp Arg Phe Asp Ala Ala Phe Phe Gly 
65 70 75 80 

He Ser Pro Arg Glu Ala Glu Leu Met Asp Pro Gin His Arg lie Phe 
85 90 95 

Met Glu Cys Ala Trp Glu Ala Leu Glu Asn Ala Gly Tyr Asp Pro Thr 
100 105 HO 

Ala Tyr Glu Gly Ser He Gly Val Tyr Ala Gly Ala Asn Met Ser Ser 
115 ' 120 125 

Tyr Leu Thr Ser Asn Leu His Glu His Pro Ala Met Met Arg Trp Pro 
130 135 140 

Gly Trp Phe Gin Thr Leu He Gly Asn Asp Lys Asp Tyr Leu Ala Thr 
145 150 155 160 

His Val Ser Tyr Arg Leu Asn Leu Arg Gly Pro Ser He Ser Val Gin 
165 170 175 

Thr Ala Cys Ser Thr Ser Leu Val Ala Val His Leu Ala Cys Met Ser 
180 185 . 190 

Leu Leu Asp Arg Glu Cys Asp Met Ala Leu Ala Gly Gly He Thr Val 
195 200 205 

Arg He Pro His Arg Ala Gly Tyr Val Tyr Ala Glu Gly Gly He Phe 
210 215 220 

Ser Pro Asp Gly His Cys Arg Ala Phe Asp Ala Lys Ala Asn Gly Thr 
225 230 235 240 

He Met Gly Asn Gly Cys Gly Val Val Leu Leu Lys Pro Leu Asp Arg 
245 250 255 

Ala Leu Ser Asp Gly Asp Pro Val Arg Ala Val He Leu Gly Ser Ala 
260 265 270 

Thr Asn Asn Asp Gly Ala Arg Lys He Gly Phe Thr Ala Pro Ser Glu 
275 280 285 
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Val Gly Gin Ala Gin Ala He Met Glu Ala Leu Ala Leu Ala Gly Val 
290 295 300 

Glu Ala Arg Ser He Gin Tyr He Glu Thr His Gly Thr Gly Thr Leu 
305 ~ 310 315 320 

Leu Gly Asp Ala He Glu Thr Ala Ala Leu Arg Arg Val Phe Gly Arg 
325 330 335 

Asp Ala Ser Ala Arg Arg Ser Cys Ala He Gly Ser Val Lys Thr Gly 
340 345 350 

He Gly His Leu Glu Ser Ala Ala Gly He Ala Gly Leu He Lys Thr 
355 360 365 

Val Leu Ala Leu Glu His Arg Gin Leu Pro Pro Ser Leu Asn Phe Glu 
370 375 380 

Ser Pro Asn Pro Ser He Asp Phe Ala Ser Ser Pro Phe Tyr Val Asn 
385 '390 395 400 

Thr Ser Leu Lys Asp Trp Asn Thr Gly Ser Thr Pro Arg Arg Ala Gly 
405 410 415 

Val Ser Ser Phe Gly He Gly Gly Thr Asn Ala His Val Val Leu Glu 
420 425 430 

Glu Ala Pro Ala Ala Lys Leu Pro Ala Ala Ala Pro Ala Arg Ser Ala 
435 440 445 

Glu Leu Phe Val Val Ser Ala Lys Ser Ala Ala Ala Leu Asp Ala Ala 
450 455 460 

Ala Ala Arg Leu Arg Asp His Leu Gin Ala His Gin Gly He Ser Leu 
465 470 475 480 

Glv Asp Val Ala Phe Ser Leu Ala Thr Thr Arg Ser Pro Met Glu His 
485 490 495 

Arg Leu Ala Met Ala Ala Pro Ser Arg Glu Ala Leu Arg Glu Gly Leu 
500 505 510 

Asp Ala Ala Ala Arg Gly Gin Thr Pro Pro Gly Ala Val Arg Gly Arg 
515 520 525 

Cys Ser Pro Gly Asn Val Pro Lys Val Val Phe Val Phe Pro Gly Gin , 
530 535 540 

Gly Ser Gin Trp Val Gly Met Gly Arg Gin Leu Leu Ala Glu Glu Pro 
545 550 555 560 

Val Phe His Ala Ala Leu Ser Ala Cys Asp Arg Ala He Gin Ala Glu 
565 570 575 

Ala Gly Trp Ser Leu Leu Ala Glu Leu Ala Ala Asp Glu Gly Ser Ser 
580 585 590 

Gin Leu Glu Arg He Asp Val Val Gin Pro Val Leu Phe Ala Leu Ala 
595 600 605 

Val Ala Phe Ala Ala Leu Trp Arg Ser Trp Gly Val Ala Pro Asp Val 
610 615 620 

Val He Gly His Ser Met Gly Glu Val Ala Ala Ala His Val Ala Gly 
625 630 635 640 
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Ala Leu Ser Leu Glu Asp Ala Val Ala lie lie Cys Arg Arg Ser Arg 
645 ' 650 655 

Leu Leu Arg Arg He Ser Gly Gin Gly Glu Met Ala Val Thr Glu Leu 
660 665 670 

Ser Leu Ala Glu Ala Glu Ala Ala Leu Arg Gly Tyr Glu Asp Arg Val 
675 680 685 

Ser Val Ala Val Ser Asn Ser Pro Arg Ser Thr Val Leu Ser Gly Glu 
690 695 700 

Pro Ala Ala He Gly Glu Val Leu Ser Ser Leu Asn Ala Lys Gly Val 
705 710 715 720 

Phe Cys Arg Arg Val Lys Val Asp Val Ala Ser His Ser Pro Gin Val 
725 730 735 

Asp Pro Leu Arg Glu Asp Leu Leu Ala Ala Leu Gly Gly Leu Arg Pro 
□ 740 ~ 745 . 750 

i : l 

i.I Gly Ala Ala Ala Val Pro Met Arg Ser Thr Val Thr Gly Ala Met Val 
> 755 760 765 

Ala Gly Pro Glu Leu Gly Ala Asn Tyr Trp Met Asn Asn Leu Arg Gin 

j.£ 770 775 780 

4 Pro Val Arg Phe Ala Glu Val Val Gin Ala Gin Leu Gin Gly Gly His 
785 ^ 790 795 800 

l-.jk . 

i.£ Gly Leu Phe Val Glu Met Ser Pro His Pro He Leu Thr Thr Ser Val 
l. A 805 810 815 

W giu Glu Met Arg Arg Ala Ala Gin Arg Ala Gly Ala Ala Val Gly Ser 
Q 820 825 830 



Leu Arg Arg Gly Gin Asp Glu Arg Pro Ala Met Leu Glu Ala Leu Gly 
835 840 845 

Thr Leu Trp Ala Gin Gly Tyr Pro Val Pro Trp Gly Arg Leu Phe Pro 
850 855 860 

Ala Gly Gly Arg Arg Val Pro Leu Pro Thr Tyr Pro Trp Gin Arg Glu ^ 
865 ~ 870 875 880 

Ara Tvx Trp He Glu Ala Pro Ala Lys Ser Ala Ala Gly Asp Arg Arg 
y 885 890 895 

Glv Val Arg Ala Gly Gly His Pro Leu Leu Gly Glu Met Gin Thr Leu 
900 905 910 

Ser Thr Gin Thr Ser Thr Arg Leu Trp Glu Thr Thr Leu Asp Leu Lys 
915 920 925 

Arg Leu Pro Trp Leu Gly Asp His Arg Val Gin Gly Ala Val Val Phe 
930 935 940 

Pro Gly Ala Ala Tyr Leu Glu Met Ala He Ser Ser Gly Ala Glu Ala 
945 950 955 960 

Leu Gly Asp Gly Pro Leu Gin He Thr Asp Val Val Leu Ala Glu Ala 
* 965 970 975 

Leu Ala Phe Ala Gly Asp Ala Ala Val Leu Val Gin Val Val Thr Thr 
980 985 990 
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Glu Gin Pro Ser Gly Arg Leu Gin Phe Gin He Ala Ser Arg Ala Pro 
995 1000 1005 

Gly Ala Gly His Ala Ser Phe Arg Val His Ala Arg Gly Ala Leu Leu 
1010 1015 1020 

Arg Val Glu Arg Thr Glu Val Pro Ala Gly Leu Thr Leu Ser Ala Val 
1025 1030 1035 1040 

Arg Ala Arg Leu Gin Ala Ser He Pro Ala Ala Ala Thr Tyr Ala Glu 
1045 1050 1055 

Leu Thr Glu Met Gly Leu Gin Tyr Gly Pro Ala Phe Gin Gly He Ala 
1060 1065 1070 

Glu Leu Trp Arg Gly Glu Gly Glu Ala Leu Gly Arg Val Arg Leu Pro 
10 75 1080 1085 

Asd Ala Ala Gly Ser Ala Ala Glu Tyr Arg Leu His Pro Ala Leu Leu 
1090 1095 HOO 

Asp Ala Cys Phe Gin He Val Gly Ser Leu Phe Ala Arg Ser Gly Glu 
H05 1110 1H5 112° 

Ala Thr Pro Trp Val Pro Val Glu Leu Gly Ser Leu Arg Leu Leu Gin 
1125 1130 113b 

Arg Pro Ser Gly Glu Leu Trp Cys His Ala Arg Val Val Asn His Gly 
u H40 1145 H50 

His Gin Thr Pro Asp Arg Gin Gly Ala Asp Phe Trp Val Val Asp Ser 
1155 1160 H 65 

Ser Gly Ala Val" Val Ala Glu Val Cys Gly Leu Val Ala Gin Arg Leu 
1170 H75 H80 

Pro Gly Gly Val Arg Arg Arg Glu Glu Asp Asp Trp Phe Leu Glu Leu 
H85 H90 H95 1200 

Glu Trp Glu Pro Ala Ala Val Gly Thr Ala Lys Val Asn Ala Gly Arg 
1205 1210 1219 

Trp Leu Leu Leu Gly Gly Gly Gly Gly Leu Gly Ala Ala Leu Arg Ala 
' 1220 1225 1230 

Met Leu Glu Ala Gly Gly His Ala Val Val His Ala Ala Glu Asn Asn , 
1235 1240 1245 

Thr Ser Ala Ala Gly Val Arg Ala Leu Leu Ala Lys Ala Phe Asp Gly 
1250 1255 1260 

Gin Ala Pro Thr Ala Val Val His Leu Gly Ser Leu Asp Gly Gly Gly 
1265 1270 1275 1280 

Glu Leu Asp Pro Gly Leu Gly Ala Gin Gly Ala Leu Asp Ala Pro Arg 
1285 1290 iws 

Ser Ala Asp Val Ser Pro Asp Ala Leu Asp Pro Ala Leu Val Arg Gly 
1300 I 305 I 310 

Cys Asp Ser Val Leu Trp Thr Val Gin Ala Leu Ala Gly Met Gly Phe 
1315 1320 1325 

Arg Asp Ala Pro Arg Leu Trp Leu Leu Thr Arg Gly Ala Gin Ala Val 
1330 1335 1340 
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Gly Ala Gly Asp Val Ser Val Thr Gin Ala Pro Leu Leu Gly Leu Gly 
1345 1350 1355 1360 

Ara Val lie Ala Met Glu His Ala Asp Leu Arg Cys Ala Arg Val Asp 
1365 1370 1375 

Leu Asp Pro Ala Arg Pro Glu Gly Glu Leu Ala Ala Leu Leu Ala Glu 
1380 1385 1390 

Leu Leu Ala Asp Asp Ala Glu Ala Glu Val Ala Leu Arg Gly Gly Glu 
1395 1400 1405 

Arg Cys Val Ala Arg lie Val Arg Arg Gin Pro Glu Thr Arg Pro Arg 
1410 1415 1420 

Glv Arg He Glu Ser Cys Val Pro Thr Asp Val Thr He Arg Ala Asp 
1425 1430 1435 1440 

Ser Thr Tyr Leu Val Thr Gly Gly Leu Gly Gly Leu Gly Leu Ser Val 
1445 1450 1455 

Ala Gly Trp Leu Ala Glu Arg Gly Ala Gly His Leu Val Leu Val Gly 
1460 1465 1470 

Arg Ser Gly Ala Ala Ser Val Glu Gin Arg Ala Ala Val Ala Ala Leu 
1475 1480 1485 

Glu Ala Arg Gly Ala Arg Val Thr Val Ala Lys Ala Asp Val Ala Asp 
1490 1495 1500 

Ara Ala Gin Leu Glu Arg He Leu Arg Glu Val Thr Thr -Ser Gly Met 
1505 1510 1515 1520 

Pro Leu Arg Gly Val Val His Ala Ala Gly He Leu Asp Asp Gly Leu 
1525 1530 1535 

Leu Met Gin Gin Thr Pro Ala Arg Phe Arg Lys Val Met Ala Pro Lys 
1540 1545 1550 

Val Gin Gly Ala Leu His Leu His Ala Leu Thr Arg Glu Ala Pro Leu 
1555 1560 1565 

Ser Phe Phe Val Leu Tyr Ala Ser Gly Val Gly Leu Leu Gly Ser Pro 
1570 1575 1580 

Glv Gin Gly Asn Tyr Ala Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala < 
1585 1590 1595 1600 

His His Arg Arg Ala Gin Gly Leu Pro Ala Leu Ser Val Asp Trp Gly 
1605 1610 1615 

Leu Phe Ala Glu Val Gly Met Ala Ala Ala Gin Glu Asp Arg Gly Ala 
1620 1625 1630 

Arg Leu Val Ser Arg Gly Met Arg Ser Leu Thr Pro Asp Glu Gly Leu 
1635 1640 1645 

Ser Ala Leu Ala Arg Leu Leu Glu Ser Gly Arg Ala Gin Val Gly Val 
1650 1655 1660 

Met Pro Val Asn Pro Arg Leu Trp Val Glu Leu Tyr Pro Ala Ala Ala 
1665 1670 1675 1680 

Ser Ser Arg Met Leu Ser Arg Leu Val Thr Ala His Arg Ala Ser Ala 
1685 1690 1695 
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Gly Gly Pro Ala Gly Asp Gly Asp Leu Leu Arg Arg Leu Ala Ala Ala 
1700 . 1705 1710 

Glu Pro Ser Ala Arg Ser Ala Leu Leu Glu Pro Leu Leu Arg Ala Gin 
1715 1720 1725 

lie Ser Gin Val Leu Arg Leu Pro Glu Gly Lys lie Glu Val Asp Ala 
1730 1735 1740 

Pro Leu Thr Ser Leu Gly Met Asn Ser Leu Met Gly Leu Glu Leu Arg 
1745 1750 1755 1760 

Asn Arg lie Glu Ala Met Leu Gly He Thr Val Pro Ala Thr Leu Leu 
1765 1770 1775 

Trp Thr Tyr Pro Thr Val Ala Ala Leu Ser Gly His Leu Ala Arg Glu 
1780 1785 1790 

Ala Cys Glu Ala Ala Pro Val Glu Ser Pro His Thr Thr Ala Asp Ser 
1795 1800 1805 

Ala Val Glu He Glu Glu Met Ser Gin Asp Asp Leu Thr Gin Leu He 
1810 1815 1820 

Ala Ala Lys Phe Lys Ala Leu Thr 
1825 1830 

<210> 5 
<211> 7257 
<212> PRT 

<213> Sorangium cellulosum 
<400> 5 

Met Thr Thr Arg Gly Pro Thr Ala Gin Gin Asn Pro Leu Lys Gin Ala 
1 5 10 15 

Ala He He He Gin Arg Leu Glu Glu Arg Leu Ala Gly Leu Ala Gin 
20 25 30 

Ala Glu Leu Glu Arg Thr Glu Pro He Ala He Val Gly He Gly Cys 
35 40 45 

Arg Phe Pro Gly Gly Ala Asp Ala Pro Glu Ala Phe Trp Glu Leu Leu 
50 55 60 

* 

Asp Ala Glu Arg Asp Ala Val Gin Pro Leu Asp Met Arg Trp Ala Leu 
65 70 75 80 

Val Gly Val Ala Pro Val Glu Ala Val Pro His Trp Ala Gly Leu Leu 
85 90 95 

Thr Glu Pro He Asp Cys Phe Asp Ala Ala Phe Phe Gly He Ser Pro 
100 105 HO 

Arg Glu Ala Arg Ser Leu Asp Pro Gin His Arg Leu Leu Leu Glu Val 
115 ~ 120 125 

Ala Trp Glu Gly Leu Glu Asp Ala Gly He Pro Pro Arg Ser He Asp 
130 135 140 

Gly Ser Arg Thr Gly Val Phe Val Gly Ala Phe Thr Ala Asp Tyr Ala 
145 150 155 160 

Arg Thr Val Ala Arg Leu Pro Arg Glu Glu Arg Asp Ala Tyr Ser Ala 
165 170 175 
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Thr Gly Asn Met .Leu Ser He Ala Ala Gly Arg Leu Ser Tyr Thr Leu 
180 185 190 

Gly Leu Gin Gly Pro Cys Leu Thr Val Asp Thr Ala Cys Ser Ser Ser 
195 200 205 

Leu Val Ala He His Leu Ala Cys Arg Ser Leu Arg Ala Gly Glu Ser 
210 215 220 

Asp Leu Ala Leu Ala Gly Gly Val Ser Ala Leu Leu Ser Pro Asp Met 
225 230 235 240 

Met Glu Ala Ala Ala Arg Thr Gin Ala Leu Ser Pro Asp Gly Arg Cys 
245 250 255 

Ara Thr Phe Asp Ala Ser Ala Asn Gly Phe Val Arg Gly Glu Gly Cys 
260 265 270 

Glv Leu Val Val Leu Lys Arg Leu Ser Asp Ala Gin Arg Asp Gly Asp 
275 280 285 

Arg He Trp Ala Leu He Arg Gly Ser Ala He Asn His Asp Gly Arg 
290 295 300 

Ser Thr Gly Leu Thr Ala Pro Asn Val Leu Ala Gin Glu Thr Val Leu 
305 310 315 320 

Arg Glu Ala Leu Arg Ser Ala His Val Glu Ala Gly Ala Val Asp Tyr 
325 330 335 

Val Glu Thr His Gly Thr Gly Thr Ser Leu Gly Asp Pro He Glu Val 
340 345 350 

Glu Ala Leu Arg Ala Thr Val Gly Pro Ala Arg Ser Asp Gly Thr Arg 
355 " 360 365 

Cys Val Leu Gly Ala Val Lys Thr Asn He Gly His Leu Glu Ala Ala 
370 375 380 

Ala Gly Val Ala Gly Leu He Lys Ala Ala Leu Ser Leu Thr His Glu 
385 390 395 400 

Ara He Pro Arg Asn Leu Asn Phe Arg Thr Leu Asn Pro Arg He Arg 
405 410 415 

Leu Glu Gly Ser Ala Leu Ala Leu Ala Thr Glu Pro Val Pro Trp Pro 
420 425 430 

Ara Thr Asp Arg Pro Arg Phe Ala Gly Val Ser Ser Phe Gly Met Ser 
435 440 445 

Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala Val Glu Leu 
450 455 460 

Trp Pro Ala Ala Pro Glu Arg Ser Ala Glu Leu Leu Val Leu Ser Gly 
465 470 475 480 

Lys Ser Glu Gly Ala Leu Asp Ala Gin Ala Ala Arg Leu Arg Glu His 
485 490 495 

Leu Asp Met His Pro Glu Leu Gly Leu Gly Asp Val Ala Phe Ser Leu 
500 505 510 

Ala Thr Thr Arg Ser Ala Met Ser His Arg Leu Ala Val Ala Val Thr 
515 520 525 



4-30582A 





34 



Ser Arg. Glu Gly Leu Leu Ala Ala Leu Ser Ala Val Ala Gin Gly Gin 
530 535 540 

Thr Pro Ala Gly Ala Ala Arg Cys He Ala Ser Ser Ser Arg Gly Lys 
545 550 555 560 

Leu Ala Phe Leu Phe Thr Gly Gin Gly Ala Gin Thr Pro Gly Met Gly 
565 570 575 

Ara Gly Leu Cys Ala Ala Trp Pro Ala Phe Arg Glu Ala Phe Asp Arg 
9 580 585 590 

Cys Val Ala Leu Phe Asp Arg Glu Leu Asp Arg Pro Leu Arg Glu Val 
595 600 605 

Met Trp Ala Glu Ala Gly Ser Ala Glu Ser Leu Leu Leu Asp Gin Thr 
610 615 620 

Ala Phe Thr Gin Pro Ala Leu Phe Ala Val Glu Tyr Ala Leu Thr Ala 
625 630 635 640 

Leu Trp Arg Ser Trp Gly Val Glu Pro Glu Leu Leu Val Gly His Ser 
645 650 655 

He Gly Glu Leu Val Ala Ala Cys Val Ala Gly Val Phe Ser Leu Glu 
660 665 670 

Aso Glv Val Arg Leu Val Ala Ala Arg Gly Arg Leu Met Gin Gly Leu 
. 675 680 685 

Ser Ala Gly Gly Ala Met Val Ser Leu Gly Ala Pro Glu Ala Glu Val 
690 695 700 

Ala Ala Ala Val Ala Pro His Ala Ala Ser Val Ser He Ala Ala Val 
705 710 715 720 

Asn Gly Pro Glu Gin Val Val He Ala Gly Val Glu Gin Ala Val Gin 
725 730 735 

Ala He Ala Ala Gly Phe Ala Ala Arg Gly Ala Arg Thr Lys Arg Leu 
740 745 750 

His Val Ser His Ala Phe His Ser Pro Leu Met Glu Pro Met Leu Glu 
755 760 765 

Glu Phe Gly Arg Val Ala Ala Ser Val Thr Tyr Arg Arg Pro Ser Val 
770 775 780 

Ser Leu Val Ser Asn Leu Ser Gly Lys Val Val Thr Asp Glu Leu Ser 
785 790 795 800 

Ala Pro Gly Tyr Trp Val Arg His Val Arg Glu Ala Val Arg Phe Ala 
805 810 815 

Asp Gly Val Lys Ala Leu His Glu Ala Gly Ala Gly Thr Phe Val Glu 
820 825 830 

Val Gly Pro Lys Pro Thr Leu Leu Gly Leu Leu Pro Ala Cys Leu Pro 
835 840 845 

Glu Ala Glu Pro Thr Leu Leu Ala Ser Leu Arg Ala Gly Arg Glu Glu 
850 855 860 

Ala Ala Gly Val Leu Glu Ala Leu Gly Arg Leu Trp Ala Ala Gly Gly 
865 870 875 880 
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Ser Val Ser Trp .Pro Gly Val Phe Pro Thr Ala Gly Arg Arg Val Pro 
885 890 895 

Leu Pro Thr Tyr Pro Trp Gin Arg Gin Arg Tyr Trp He Glu Ala Pro 
900 905 910 

Ala Glu Gly Leu Gly Ala Thr Ala Ala Asp Ala Leu Ala Gin Trp Phe 
915 920 925 

Tvr Arg Val Asp Trp Pro Glu Met Pro Arg Ser Ser Val Asp Ser Arg 
930 935 940 

Arg Ala Arg Ser Gly Gly Trp Leu Val Leu Ala Asp Arg Gly Gly Val 
945 950 955 960 

Gly Glu Ala Ala Ala Ala Ala Leu Ser Ser Gin Gly Cys Ser Cys Ala 
965 970 975 

Val Leu His Ala Pro Ala Glu Ala Ser Ala Val Ala Glu Gin Val Thr 
980 985 990 

Gin Ala Leu Gly Gly Arg Asn Asp Trp Gin Gly Val Leu Tyr Leu Trp 
995 1000 1005 

Gly Leu Asp Ala Val Val Glu Ala Gly Ala Ser Ala Glu Glu Val Ala 
1010 1015 1020 

Lys Val Thr His Leu Ala Ala Ala Pro Val Leu Ala Leu He Gin Ala 
1025 1030 1035 1040 

Leu Gly Thr Gly Pro Arg Ser Pro Arg Leu Trp He Val Thr Arg Gly 
1045 1050 1055 

Ala Cys Thr Val Gly Gly Glu Pro Asp Ala Ala Pro Cys Gin Ala Ala 
1060 1065 1070 

Leu Trp Gly Met Gly Arg Val Ala Ala Leu Glu His Pro Gly Ser Trp 
1075 1080 1085 

Gly Gly Leu Val Asp Leu Asp Pro Glu Glu Ser Pro Thr Glu Val Glu 
1090 1095 1100 

Ala Leu Val Ala Glu Leu Leu Ser Pro Asp Ala Glu Asp Gin Leu Ala 
1105 HIO HI 5 1120 

Phe Arg Gin Gly Arg Arg Arg Ala Ala Arg Leu Val Ala Ala Pro Pro 
1125 1130 xijo 

Glu Gly Asn Ala Ala Pro Val Ser Leu Ser Ala Glu GlySer Tyr Leu 



1140 



1145 H50 



Val Thr Gly Gly Leu Gly Ala Leu Gly Leu Leu Val Ala Arg Trp Leu 
1155 1160 1165 

Val Glu Arg Gly Ala Gly His Leu Val Leu He Ser Arg His Gly Leu 
1170 H75 H80 

Pro Asp Arg Glu Glu Trp Gly Arg Asp Gin Pro Pro Glu Val Arg Ala 
1185 H90 H95 1200 

Arg He Ala Ala He Glu Ala Leu Glu Ala Gin Gly Ala Arg Val Thr 
1205 1210 1215 

Val Ala Ala Val Asp Val Ala Asp Ala Glu Gly Met Ala Ala Leu Leu 
1220 1225 .1230 
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Ala Ala Val Glu Pro Pro Leu Arg Gly Val Val His Ala Ala Gly Leu 
1235 1240 1245 

Leu Asp Asp Gly Leu Leu Ala His Gin Asp Ala Gly Arg Leu Ala Arg 
1250 ' 1255 1260 

Val Leu Arg Pro Lys Val Glu Gly Ala Trp Val Leu His Thr Leu Thr 
1265 ~ 1270 1275 1280 

Ara Glu Gin Pro Leu Asp Leu Phe Val Leu Phe Ser Ser Ala Ser Gly 
1285 1290 1295 

Val Phe Gly Ser lie Gly Gin Gly Ser Tyr Ala Ala Gly Asn Ala Phe 
1300 1305 1310 

Leu Asp Ala Leu Ala Asp Leu Arg Arg Thr Gin Gly Leu Ala Ala Leu 
1315 1320 1325 

Ser He Ala Trp Gly Leu Trp Ala Glu Gly Gly Met Gly Ser Gin Ala 
1330 1335 1340 

Gin Arg Arg Glu His Glu Ala Ser Gly He Trp Ala Met Pro Thr Ser 
1345 1350 1355 1360 

Arg Ala Leu Ala Ala Met Glu Trp Leu Leu Gly Thr Arg Ala Thr Gin 
y 1365 1370 1375 

Ara Val Val He Gin Met Asp Trp Ala His Ala Gly Ala Ala Pro Arg 
1380 1385 1390 

Asp Ala Ser Arg Gly Arg Phe Trp Asp Arg Leu Val Thr Ala Thr Lys 
1395 " 1400 1405 

Glu Ala Ser Ser Ser Ala Val Pro Ala Val Glu Arg Trp Arg Asn Ala 
1410 1415 1420 

Ser Val Val Glu Thr Arg Ser Ala Leu Tyr Glu Leu Val Arg Gly Val 
1425 1430 1435 1440 

Val Ala Gly Val Met Gly Phe Thr Asp Gin Gly Thr Leu Asp Val Arg 
. 1445 1450 1455 

Ara Gly Phe Ala Glu Gin Gly Leu Asp Ser Leu Met Ala Val Glu He 
1460 1465 1470 

Ara Lys Arg Leu Gin Gly Glu Leu Gly Met Pro Leu Ser Ala Thr Leu 
1475 ~ 1480 1485 

Ala Phe Asp His Pro Thr Val Glu Arg Leu Val Glu Tyr Leu Leu Ser 
1490 1495 1500 

Gin Ala Leu Glu Leu Gin Asp Arg Thr Asp Val Arg Ser Val Arg Leu 
1505 1510 1515 1520 

Pro Ala Thr Glu Asp Pro He Ala He Val Gly Ala Ala Cys Arg Phe 
1525 1530 1535 

Pro Gly Gly Val Glu Asp Leu Glu Ser Tyr Trp Gin Leu Leu Thr Glu 
1540 " 1545 1550 

Gly Val Val Val Ser Thr Glu Val Pro Ala Asp Arg Trp Asn Gly Ala 
1555 1560 1565 

Asp Gly Arg Val Pro Gly Ser Gly Glu Ala Gin Arg Gin Thr Tyr Val 
1570 1575 1580 
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Pro Arg Gly Gly .Phe Leu Arg Glu Val Glu Thr Phe Asp Ala Ala Phe 
1585 1590 1595 1600 

Phe His He Ser Pro Arg Glu Ala Met Ser Leu Asp Pro Gin Gin Arg 
1605 1610 1615 

Leu Leu Leu Glu Val Ser Trp Glu Ala He Glu Arg Ala Gly Gin Asp 
1620 1625 1630 

Pro Ser Ala Leu Arg Glu Ser Pro Thr Gly Val Phe Val Gly Ala Gly 
1635 1640 1645 

Pro Asn Glu Tyr Ala Glu Arg Val Gin Glu Leu Ala Asp Glu Ala Ala 
1650 1655 1660 

Gly Leu Tyr Ser Gly Thr Gly Asn Met Leu Ser Val Ala Ala Gly Arg 
1665 1670 1675 1680 

Leu Ser Phe Phe Leu Gly Leu His Gly Pro Thr Leu Ala Val Asp Thr 
1685 1690 1695 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Gly Cys Gin Ser Leu 
* 1700 1705 1710 

Ara Arg Gly Glu Cys Asp Gin Ala Leu Val Gly Gly Val Asn Met Leu 
1715 1720 1725 

Leu Ser Pro Lys Thr Phe Ala Leu Leu Ser Arg Met His Ala Leu Ser 
1730 1735 1740 

Pro Gly Gly Arg Cys Lys Thr Phe Ser Ala Asp Ala Asp Gly Tyr Ala 
1745 1750 1755 1760 

Arg Ala Glu Gly Cys Ala Val Val Val Leu Lys Arg Leu Ser Asp Ala 
1765 1770 1775 

Gin Arg Asp Arg Asp Pro He Leu Ala Val He Arg Gly Thr Ala He 
1780 1785 1'90 

Asn His Asp Gly Pro Ser Ser Gly Leu Thr Val Pro Ser Gly Pro Ala 
1795 1800 18°5 

Gin Glu Ala Leu Leu Arg Gin Ala Leu Ala His Ala Gly Val Val Pro 
1810 1815 1820 

Ala Asp Val Asp Phe Val Glu Cys His Gly Thr Gly Thr Ala Leu Gly 
1825 1830 1835 1840 

Asp Pro He Glu Val Arg Ala Leu Ser Asp Val Tyr Gly Gin Ala Arg 
1845 1850 1855 

Pro Ala Asp Arg Pro Leu He Leu Gly Ala Ala Lys Ala Asn Leu Gly 
I860 1865 1870 

His Met Glu Pro Ala Ala Gly Leu Ala Gly Leu Leu Lys Ala Val Leu 
1875 1880 1885 

Ala Leu Gly Gin Glu Gin He Pro Ala Gin Pro Glu Leu Gly Glu Leu 
1890 1895 1900 

Asn Pro Leu Leu Pro Trp Glu Ala Leu Pro Val Ala Val Ala Arg Ala 
1905 1910 1915 1920 

Ala Val Pro Trp Pro Arg Thr Asp Arg Pro Arg Phe Ala Gly Val Ser 
1925 1930 193b 
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Ser Phe Gly Met .Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala 
1940 1945 1950 

Pro Ala Val Glu Leu Trp Pro Ala Ala Pro Glu Arg Ser Ala Glu Leu 
1955 1960 1965 

Leu Val Leu Ser Gly Lys Ser Glu Gly Ala Leu Asp Ala Gin Ala Ala 
1970 1975 1980 

Arg Leu Arg Glu His Leu Asp Met His Pro Glu Leu Gly Leu Gly Asp 
1985 1990 1995 2000 

Val Ala Phe Ser Leu Ala Thr Thr Arg Ser Ala Met Asn His Arg Leu 
2005 2010 2015 

Ala Val Ala Val Thr Ser Arg Glu Gly Leu Leu Ala Ala Leu Ser Ala 
2020 2025 2030 

Val Ala Gin Gly Gin Thr Pro Pro Gly Ala Ala Arg Cys lie Ala Ser 
2035 * 2040 2045 

Q Ser Ser Arg Gly Lys Leu Ala Phe Leu Phe Thr Gly Gin Gly Ala Gin 

Q 2050 ~ 2055 2060 

V Z Thr Pro Gly Met Gly Arg Gly Leu Cys Ala Ala Trp Pro Ala Phe Arg 

a h 2065 2070 2075 2080 

Glu Ala Phe Asp Arg Cys Val Ala Leu Phe Asp Arg Glu Leu Asp Arg 
2085 2090 2095 

Pro Leu Arg Glu Val Met Trp Ala Glu Pro Gly Ser Ala Glu Ser Leu 
H 2100 2105 2110 

Leu Leu Asp Gin Thr Ala Phe Thr Gin Pro Ala Leu Phe Thr Val Glu 
2115 2120 2125 

id 

i l Tyr Ala Leu Thr Ala Leu Trp Arg Ser Trp Gly Val Glu Pro Glu Leu 
\2 2130 2135 2140 

Val Ala Gly His Ser Ala Gly Glu Leu Val Ala Ala Cys Val Ala Gly 
2145 2150 2155 2160 

Val Phe Ser Leu Glu Asp Gly Val Arg Leu Val Ala Ala Arg Gly Arg 
2165 2170 2175 

Leu Met Gin Gly Leu Ser Ala Gly Gly Ala Met Val Ser Leu Gly Ala 
2180 2185 2190 

Pro Glu Ala Glu Val Ala Ala Ala Val Ala Pro His Ala Ala Ser Val 
2195 2200 2205 

Ser lie Ala Ala Val Asn Gly Pro Glu Gin Val Val lie Ala Gly Val 
2210 2215 2220 

Glu Gin Ala Val Gin Ala lie Ala Ala Gly Phe Ala Ala Arg Gly Ala 
2225 2230 2235 2240 

Arg Thr Lys Arg Leu His Val Ser His Ala Ser His Ser Pro Leu Met 
2245 2250 2255 

Glu Pro Met Leu Glu Glu Phe Gly Arg Val Ala Ala Ser Val Thr Tyr 
2260 2265 2270 

Arg Arg Pro Ser Val Ser Leu Val Ser Asn Leu Ser Gly Lys Val Val 
2275 2280 2285 
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Ala Asp Glu Leu Ser Ala Pro Gly Tyr Trp Val Arg His Val Arg Glu 
2290 2295 2300 

Ala Val Arg Phe Ala Asp Gly Val Lys Ala Leu His Glu Ala Gly Ala 
2305 2310 2315 2320 

Gly Thr Phe Val Glu Val Gly Pro Lys Pro Thr Leu Leu Gly Leu Leu 
2325 2330 2335 

Pro Ala Cys Leu Pro Glu Ala Glu Pro Thr Leu Leu Ala Ser Leu Arg 
2340 2345 2350 

Ala Gly Arg Glu Glu Ala Ala Gly Val Leu Glu Ala Leu Gly Arg Leu 
2355 2360 2365 

Trp Ala Ala Gly Gly Ser Val Ser Trp Pro Gly Val Phe Pro Thr Ala 
2370 2375 2380 

Gly Arg Arg Val Pro Leu Pro Thr Tyr Pro Trp Gin Arg Gin Arg Tyr 
2385 ~ 2390 2395 2400 

Trp Pro Asp lie Glu Pro Asp Ser Arg Arg His Ala Ala Ala Asp Pro 
2405 2410 2415 

Thr Gin Gly Trp Phe Tyr Arg Val Asp Trp Pro Glu lie Pro Arg Ser 
2420 2425 2430 

Leu Gin Lys Ser Glu Glu Ala Ser Arg Gly Ser Trp Leu Val Leu Ala 
2435 2440 2445 

Asp Lys Gly Gly Val Gly Glu Ala Val Ala Ala Ala Leu Ser Thr Arg 
2450 ** 2455 2460 

Gly Leu Pro Cys Val Val Leu His Ala Pro Ala Glu Thr Ser Ala Thr 
2465 2470 2475 2480 

Ala Glu Leu Val Thr Glu Ala Ala Gly Gly Arg Ser Asp Trp Gin Val 
2485 2490 2495 

Val Leu Tyr Leu Trp Gly Leu Asp Ala Val Val Gly Ala Glu Ala Ser 
2500 2505 2510 

lie Asp Glu lie Gly Asp Ala Thr Arg Arg Ala Thr Ala Pro Val Leu 
2515 2520 2525 

Gly Leu Ala Arg Phe Leu Ser Thr Val Ser Cys Ser Pro Arg Leu Trp 
2530 2535 2540 

Val Val Thr Arg Gly Ala Cys He Val Gly Asp Glu Pro Ala He Ala 
2545 2550 2555 2560 

Pro Cys Gin Ala Ala Leu Trp Gly Met Gly Arg Val Ala Ala Leu Glu 
2565 2570 2575 

His Pro Gly Ala Trp Gly Gly Leu Val Asp Leu Asp Pro Arg Ala Ser 
2580 2585 2590 

Pro Pro Gin Ala Ser Pro He Asp Gly Glu Met Leu Val Thr Glu Leu 
2595 2600 2605 

Leu Ser Gin Glu Thr Glu Asp Gin Leu Ala Phe Arg His Gly Arg Arg 
2610 2615 2620 

His Ala Ala Arg Leu Val Ala Ala Pro Pro Gin Gly Gin Ala Ala Pro 



2625 



2630 



2635 



2640 
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Val Ser Leu Ser -Ala Glu Ala Ser Tyr Leu Val Thr Gly Gly Leu Gly 
2645 2650 2655 

Gly Leu Gly Leu lie Val Ala Gin Trp Leu Val Glu Leu Gly Ala Arg 
2660 2665 2670 

His Leu Val Leu Thr Ser Arg Arg Gly Leu Pro Asp Arg Gin Ala Trp 
2675 2680 2685 

Cvs Glu Gin Gin Pro Pro Glu lie Arg Ala Arg lie Ala Ala Val Glu 
2690 2695 2700 

Ala Leu Glu Ala Arg Gly Ala Arg Val Thr Val Ala Ala Val Asp Val 
2705 2710 2715 2720 

Ala Asp Val Glu Pro Met Thr Ala Leu Val Ser Ser Val Glu Pro Pro 
2725 2730 2735 

Leu Arg Gly Val Val His Ala Ala Gly Val Ser Val Met Arg Pro Leu 
2740 2745 2750 

Ala Glu Thr Asp Glu Thr Leu Leu Glu Ser Val Leu Arg Pro Lys Val 
2755 2760 2765 

Ala Gly Ser Trp Leu Leu His Arg Leu Leu His Gly Arg Pro Leu Asp 
2770 2775 2780 

Leu Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Ser His Ser 
2785 2790 2795 2800 

Gin Gly Ala Tyr Ala Ala Ala Asn Ala Phe Leu Asp Gly Leu Ala His 
2805 .2810 2815 

Leu Arg Arg Ser Gin Ser Leu Pro Ala Leu Ser Val Ala Trp Gly Leu 
2820 2825 2830 

Tro Ala Glu Gly Gly Met Ala Asp Ala Glu Ala His Ala Arg Leu Ser 
2835 ~ 2840 2845 

Asp He Gly Val Leu Pro Met Ser Thr Ser Ala Ala Leu Ser Ala Leu 
2850 2855 2860 

Gin Arg Leu Val Glu Thr Gly Ala Ala Gin Arg Thr Val Thr Arg Met 
2865 2870 2875 2880 

Asp Trp Ala Arg Phe Ala Pro Val Tyr Thr Ala Arg Gly Arg Arg Asn 
2885 2890 2895 

Leu Leu Ser Ala Leu Val Ala Gly Arg Asp He He Ala Pro Ser Pro 
2900 2905 2910 

Pro Ala Ala Ala Thr Arg Asn Trp Arg Gly Leu Ser Val Ala Glu Ala 
2915 2920 2925 

Arg val Ala Leu His Glu He Val His Gly Ala Val Ala Arg Val Leu 
2930 2935 2940 

Gly Phe Leu Asp Pro Ser Ala Leu Asp Pro Gly Met Gly Phe Asn Glu 
2945 2950 2955 2960 

Gin Gly Leu Asp Ser Leu Met Ala Val Glu He Arg Asn Leu Leu Gin 
2965 2970 2975 

Ala Glu Leu Asp Val Arg Leu Ser Thr Thr Leu Ala Phe Asp His Pro 
2980 2985 2990 
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Thr Val Gin Arg Leu Val Glu His Leu Leu Val Asp Val Leu Lys Leu 
2995 3000 3005 

Glu Asp Arg Ser Asp Thr Gin His Val Arg Ser Leu Ala Ser Asp Glu 
3010 3015 3020 

Pro He Ala He Val Gly Ala Ala Cys Arg Phe Pro Gly Gly Val Glu 
3025 3030 3035 3040 

Asp Leu Glu Ser Tyr Trp Gin Leu Leu Ala Glu Gly Val Val Val Ser 
3045 3050 3055 

Ala Glu Val Pro Ala Asp Arg Trp Asp Ala Ala Asp Trp Tyr Asp Pro 
3060 3065 3070 

Asp Pro Glu He Pro Gly Arg Thr Tyr Val Thr Lys Gly Ala Phe Leu 
3075 3080 3085 

Arg Asp Leu Gin Arg Leu Asp Ala Thr Phe Phe Arg He Ser Pro Arg 
3090 3095 3100 

!3 Glu Ala Met Ser Leu Asp Pro Gin Gin Arg Leu Leu Leu Glu Val Ser 
!'f 310 5 3110 3115 3120 

-C Trp Glu Ala Leu Glu Ser Ala Gly He Ala Pro Asp Thr Leu Arg Asp 
SJ 3125 3130 3135 

!"* ser Pro Thr Gly Val Phe Val Gly Ala Gly Pro Asn Glu Tyr Tyr Thr 
4 3140 3145 3150 

U Gin Arg Leu Arg Gly Phe Thr Asp Gly Ala Ala Gly Leu Tyr Gly Gly 
\ A 3155 3160 3165 

^ Thr Gly Asn Met Leu Ser Val Thr Ala Gly Arg Leu Ser Phe Phe Leu 

l,g 3170 3175 3180 

if Gly Leu His Gly Pro Thr Leu Ala Met Asp Thr Ala Cys Ser Ser Ser 

i-± 33^,5 3190 3195 

Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Leu Gly flu Cys 
3205 3210 jzis 

Asp Gin Ala Leu Val Gly Gly Val Asn Val Leu Leu Ala Pro Glu Thr 
3220 3225 3230 

Phe Val Leu Leu Ser Arg Met Arg Ala Leu Ser Pro Asp Gly Arg Cys 
3235 3240 3245 

Lys Thr Phe Ser Ala Asp Ala Asp Gly Tyr Ala Arg Gly Glu Gly Cys 
3250 3255 3260 

Ala Val Val Val Leu Lys Arg Leu Arg Asp Ala Gin Arg Ala Gly Asp 
3265 3270 3275 3280 

Ser He Leu Ala Leu He Arg Gly Ser Ala Val Asn His Asp Gly Pro 
3285 3290 

Ser Ser Gly Leu Thr Val Pro Asn Gly Pro Ala Gin Gin Ala Leu Leu 
3300 3305 3310 

Arg Gin Ala Leu Ser Gin Ala Gly Val Ser Pro Val Asp Val Asp Phe 
3315 3320 3325 

Val Glu Cys His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Val 
3330 3335 3340 
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Gin Ala Leu Ser Glu Val Tyr Gly Pro Gly Arg Ser Gly Asp Arg Pro 
3345 * 3350 3355 3360 

Leu Val Leu Gly Ala Ala Lys Ala Asn Val Ala His Leu Glu Ala Ala 
3365 3370 3375 

Ser Glv Leu Ala Ser Leu Leu Lys Ala Val Leu Ala Leu Arg His Glu 
* 3380 3385 3390 

Gin He Pro Ala Gin Pro Glu Leu Gly Glu Leu Asn Pro His Leu Pro 
3395 3400 3405 

Trp Asn Thr Leu Pro Val Ala Val Pro Arg Lys Ala Val Pro Trp Gly 
3410 3415 3420 

Arg Gly Ala Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Leu Ser 
3425 3430 3435 3440 

Glv Thr Asn Val His Val Val Leu Glu Glu Ala Pro Glu Val Glu Pro 

* 3445 3450 3455 

2 Ala Pro Ala Ala Pro Ala Arg Pro Val Glu Leu Val Val Leu Ser Ala 
J 3460 3465 3470 

C Lys Ser Ala Ala Ala Leu Asp Ala Ala Ala Ala Arg Leu Ser Ala His 
5 3475 3480 3485 

* Leu Ser Ala His Pro Glu Leu Ser Leu Gly Asp Val Ala Phe Ser Leu 
4 3490 3495 . 3500 

" »' Ala Thr Thr Arg Ser Pro Met Glu His Arg Leu Ala He Ala Thr Thr 

I / 3505 3510 3515 3520 

I"* Ser Arg Glu Ala Leu Arg Gly Ala Leu Asp Ala Ala Ala Gin Gin Lys 
;y 3525 3530 3535 

13 Thr Pro Gin Gly Ala Val Arg Gly Lys Ala Val Ser Ser Arg Gly Lys 
!■* 3540 3545 3550 

Leu Ala Phe Leu Phe Thr Gly Gin Gly Ala Gin Met Pro Gly Met Gly 
3555 3560 3565 

Ara Glv Leu Tyr Glu Thr Trp Pro Ala Phe Arg Glu Ala Phe Asp Arg 
3570 3575 3580 

Cys Val Ala Leu Phe Asp Arg Glu He Asp Gin Pro Leu Arg Glu Val 
3585 3590 3595 3600 

Met Trp Ala Ala Pro Gly Leu Ala Gin Ala Ala Arg Leu Asp Gin Thr 
3605 3610 3615 

Ala Tyr Ala Gin Pro Ala Leu Phe Ala Leu Glu Tyr Ala Leu Ala Ala 
y 3620 3625 3630 

Leu Trp Arg Ser Trp Gly Val Glu Pro His Val Leu Leu Gly His Ser 
3635 ~ 3640 3645 

He Gly Glu Leu Val Ala Ala Cys Val Ala Gly Val Phe Ser Leu Glu 
3650 3655 3660 

Asp Ala Val Arg Leu Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu 
3665 ~ 3670 3675 3680 

Pro Ala Gly Gly Ala Met Val Ala He Ala Ala Ser Glu Ala Glu Val 
3685 3690 3695 
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Ala Ala Ser Val Ala Pro His Ala Ala Thr Val Ser He Ala Ala Val 
3700 * 3705 3710 

Asn Gly Pro Asp Ala Val Val He Ala Gly Ala Glu Val Gin Val Leu 
3715 3720 3725 

Ala Leu Gly Ala Thr Phe Ala Ala Arg Gly He Arg Thr Lys Arg Leu 
3730 3735 3740 

Ala Val Ser His Ala Phe His Ser Pro Leu Met Asp Pro Met Leu Glu 
3745 3750 3755 3760 

Asp Phe Gin Arg Val Ala Ala Thr He Ala Tyr Arg Ala Pro Asp Arg 
* 3765 3770 3775 

Pro Val Val Ser Asn Val Thr Gly His Val Ala Gly Pro Glu He Ala 
3780 3785 3790 

Thr Pro Glu Tyr Trp Val Arg His Val Arg Ser Ala Val Arg Phe Gly 
3795 3800 3805 

Asp Gly Ala Lys Ala Leu His Ala Ala Gly Ala Ala Thr Phe Val Glu 
3810 3815 3820 

Val Glv Pro Lvs Pro Val Leu Leu Gly Leu Leu Pro Ala Cys Leu Gly 
3825 3830 3835 3840 

Glu Ala Asp Ala Val Leu Val Pro Ser Leu Arg Ala Asp Arg Ser Glu 
3845 3850 3855 

Cys Glu Val Val Leu Ala Ala Leu Gly Ala Trp Tyr Ala Trp Gly Gly 
3860 3865 *870 

Ala Leu Asp Trp Lys Gly Val Phe Pro Asp Gly Ala Arg Arg Val Ala 
3875 3880 3885 

Leu Pro Met Tyr Pro Trp Gin Arg Glu Arg His Trp Met Asp Leu Thr 
3890 3895 3900 

Pro Arg Ser Ala Ala Pro Ala Gly He Ala Gly Arg Trp Pro Leu Ala 
3905 3910 3915 3920 

Glv Val Gly Leu Cys Met Pro Gly Ala Val Leu His His Val Leu Ser 
3925 3930 3935 

He Glv Pro Arg His Gin Pro Phe Leu Gly Asp His Leu Val Phe Gly 
3940 3945 3950 

Lvs Val Val Val Pro Gly Ala Phe His Val Ala Val He Leu Ser He 
3955 3960 3965 

Ala Ala Glu Arg Trp Pro Glu Arg Ala He Glu Leu Thr Gly Val Glu 
3970 ~ 3975 3980 

Phe Leu Lys Ala He Ala Met Glu Pro Asp Gin Glu Val Glu Leu His 
3985 3990 3995 4000 

Ala Val Leu Thr Pro Glu Ala Ala Gly Asp Gly Tyr Leu Phe Glu Leu 
4005 4010 4015 

Ala Thr Leu Ala Ala Pro Glu Thr Glu Arg Arg Trp Thr Thr His Ala 
4020 4025 4030 

Arg Gly Arg Val Gin Pro Thr Asp Gly Ala Pro Gly Ala Leu Pro Arg 
. 4035 4040 4045 



4-30582A 



44 



Leu Glu Val Leu Glu Asp Arg Ala He Gin Pro Leu Asp Phe Ala Gly 
4050 4055 4060 

Phe Leu Asp Arg Leu Ser Ala Val Arg He Gly Trp Gly Pro Leu Trp 
4065 4070 4075 4080 

Ara Trp Leu Gin Asp Gly Arg Val Gly Asp Glu Ala Ser Leu Ala Thr 
4085 4090 4095 

Leu Val Pro Thr Tyr Pro Asn Ala His Asp Val Ala Pro Leu His Pro 
4100 4105 4110 

He Leu Leu Asp Asn Gly Phe Ala Val Ser Leu Leu Ser Thr Arg Ser 
4115 4120 4125 

Glu Pro Glu Asp Asp Gly Thr Pro Pro Leu Pro Phe Ala Val Glu Arg 
4130 ~ 4135 4140 

Val Arg Trp Trp Arg Ala Pro Val Gly Arg Val Arg Cys Gly Gly Val 
4145 4150 4155 4160 

Pro Arg Ser Gin Ala Phe Gly Val Ser Ser Phe Val Leu Val Asp Glu 
4165 4170 4175 

Thr Gly Glu Val Val Ala Glu Val Glu Gly Phe Val Cys Arg Arg Ala 
4180 4185 4190 

Pro Arg Glu Val Phe Leu Arg Gin Glu Ser Gly Ala Ser Thr Ala Ala 
4195 4200 4205 

Leu Tyr Arg Leu Asp Trp Pro Glu Ala Pro Leu Pro Asp Ala Pro Ala 
4210 4215 4220 

Glu Arg He Glu Glu Ser Trp Val Val Val Ala Ala Pro Gly Ser Glu 
4225 4230 4235 4240 

Met Ala Ala Ala Leu Ala Thr Arg Leu Asn Arg Cys Val Leu Ala Glu 
4245 4250 4255 

Pro Lys Gly Leu Glu Ala Ala Leu Ala Gly Val Ser Pro Ala Gly Val 
4260 4265 4270 

He Cys Leu Trp Glu Ala Gly Ala His Glu Glu Ala Pro Ala Ala Ala 
4275 4280 4285 

Gin Arg Val Ala Thr Glu Gly Leu Ser Val Val Gin Ala Leu Arg Asp 
4290 4295 4300 

Arg Ala Val Arg Leu Trp Trp Val Thr Met Gly Ala Val Ala Val Glu 
4305 4310 4315 4320 

Ala Gly Glu Arg Val Gin Val Ala Thr Ala Pro Val Trp Gly Leu Gly 
4325 4330 4335 

Arg Thr Val Met Gin Glu Arg Pro Glu Leu Ser Cys Thr Leu Val Asp 
4340 4345 4350 

Leu Glu Pro Glu Ala Asp Ala Ala Arg Ser Ala Asp Val Leu Leu Arg. 
4355 4360 4365 

Glu Leu Gly Arg Ala Asp Asp Glu Thr Gin Val Ala Phe Arg Ser Gly 
4370 4375 4380 

Lys Arg Arg Val Ala Arg Leu Val Lys Ala Thr Thr Pro Glu Gly Leu 
4385 4390 4395 4400 
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Leu Val Pro Asp Ala Glu Ser Tyr Arg Leu Glu Ala Gly Gin Lys Gly 
4405 4410 4415 

Thr Leu Asp Gin Leu Arg Leu Ala Pro Ala Gin Arg Arg Ala Pro Gly 
4420 4425 4430 

Pro Gly Glu Val Glu He Lys Val Thr Ala Ser Gly Leu Asn Phe Arg 
4435 4440 4445 

Thr Val Leu Ala Val Leu Gly Met Tyr Pro Gly Asp Ala Gly Pro Met 
4450 4455 4460 

Glv Gly Asp Cys Ala Gly Val Ala Thr Ala Val Gly Gin Gly Val Arg 
4465 4470 4475 4480 

His Val Ala Val Gly Asp Ala Val Met Thr Leu Gly Thr Leu His Arg 
4485 4490 4495 

Phe Val Thr Val Asp Ala Arg Leu Val Val Arg Gin Pro Ala Gly Leu 
tJU 4500 4505 4510 

^ Thr Pro Ala Gin Ala Ala Thr Val Pro Val Ala Phe Leu Thr Ala Trp 
□ 4515 4520 4525 

I A 

> Leu Ala Leu His Asp Leu Gly Asn Leu Arg Arg Gly Glu Arg Val Leu 
4530 4535 4540 

I :aL He His Ala Ala Ala Gly Gly Val Gly Met Ala Ala Val Gin lie Ala 
%J 4545 4550 4555 4560 

- Arg Trp He Gly Ala Glu Val Phe Ala Thr Ala Ser Pro Ser Lys Trp 
j 1 * * 4565 4570 4575 

U Ala Ala Val Gin Ala Met Gly Val Pro Arg Thr His He Ala Ser Ser 
"j j 4580 4585 4590 

Q Arg Thr Leu Glu Phe Ala Glu Thr Phe Arg Gin Val Thr Gly Gly Arg 
j:£ 4595 4600 4605 

Glv Val Asp Val Val Leu Asn Ala Leu Ala Gly Glu Phe Val Asp Ala 
4610 4615 4620 

Ser Leu Ser Leu Leu Ser Thr Gly Gly Arg Phe Leu Glu Met Gly Lys 
4625 4630 4635 4640 

Thr Asp He Arg Asp Arg Ala Ala Val Ala Ala Ala His Pro Gly Val 
4645 4650 4655 

Ara Tyr Arg Val Phe Asp He Leu Glu Leu Ala Pro Asp Arg Thr Arg 
y 4660 4665 4670 

Glu He Leu Glu Arg Val Val Glu Gly Phe Ala Ala Gly His Leu Arg 
4675 4680 4685 

Ala Leu Pro Val His Ala Phe Ala He Thr Lys Ala Glu Ala Ala Phe 
4690 4695 4700 

Arg Phe Met Ala Gin Ala Arg His Gin Gly Lys Val Val Leu Leu Pro 
4705 4710 4715 4720 

Ala Pro Ser Ala Ala Pro Leu Ala Pro Thr Gly Thr Val Leu Leu Thr 
4725 4730 4735 

Gly Gly Leu Gly Ala Leu Gly Leu His Val Ala Arg Trp Leu Ala Gin 
4740 4745 4750 
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Gin Gly Val Pro His Met Val Leu Thr Gly Arg Arg Gly Leu Asp Thr 
4755 4760 4765 

Pro Gly Ala Ala Lys Ala Val Ala Glu lie Glu Ala Leu Gly Ala Arg 
4770 "* 4775 4780 

Val Thr lie Ala Ala Ser Asp Val Ala Asp Arg Asn Ala Leu Glu Ala 
4785 4790 4795 4800 

Val Leu Gin Ala He Pro Ala Glu Trp Pro Leu Gin Gly Val He His 
4805 4810 4815 

Ala Ala Gly Ala Leu Asp Asp Gly Val Leu Asp Glu Gin Thr Thr Asp 
4820 ~ 4825 4830 

Arg Phe Ser Arg Val Leu Ala Pro Lys Val Thr Gly Ala Trp Asn Leu 
4835 " 4840 4845 

His Glu Leu Thr Ala Gly Asn Asp Leu Ala Phe Phe Val Leu Phe Ser 
4850 4855 4860 

Ser Met Ser Gly Leu Leu Gly Ser Ala Gly Gin Ser Asn Tyr Ala Ala 
4865 4870 4875 4880 

Ala Asn Thr Phe Leu Asp Ala Leu Ala Ala His Arg Arg Ala Glu Gly 
4885 4890 4895 

Leu Ala Ala Gin Ser Leu Ala Trp Gly Pro Trp Ser Asp Gly Gly Met 
4900 4905 4910 

Ala Ala Gly Leu Ser Ala Ala Leu Gin Ala Arg Leu Ala Arg His Gly 
4915 4920 4925 

Met Gly Ala Leu Ser Pro Ala Gin Gly Thr Ala Leu Leu Gly Gin Ala 
4930 4935 4940 

Leu Ala Arg Pro Glu Thr Gin Leu Gly Ala Met Ser Leu Asp Val Arg 
4945 4950 4955 4960 

Ala Ala Ser Gin Ala Ser Gly Ala Ala Val Pro Pro Val Trp Arg Ala 
4965 4970 4975 

Leu Val Arg Ala Glu Ala Arg His Thr Ala Ala Gly Ala Gin Gly Ala 
4980 4985 4990 

Leu Ala Ala Arg Leu Gly Ala Leu Pro Glu Ala Arg Arg Ala Asp Glu 
4995 ~ " 5000 5005 

Val Arg Lys Val Val Gin Ala Glu He Ala Arg Val Leu Ser Trp Ser 
5010 5015 5020 

Ala Ala Ser Ala Val Pro Val Asp Arg Pro Leu Ser Asp Leu Gly Leu 
5025 5030 5035 5040 

Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Val Leu Gly Gin Arg Val 
5045 5050 5055 

Glv Ala Thr Leu Pro Ala Thr Leu Ala Phe Asp His Pro Thr Val Asp 
5060 5065 5070 

Ala Leu Thr Arg Trp Leu Leu Asp Lys Val Leu Ala Val Ala Glu Pro 
5075 ~ 5080 5085 

Ser Val Ser Ser Ala Lys Ser Ser Pro Gin Val Ala Leu Asp Glu Pro 
5090 5,095 5100 
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Ile Ala He He Gly He Gly Cys Arg Phe Pro Gly Gly Val Ala Asp 
5105 * 5110 5115 5120 

Pro Glu Ser Phe Trp Arg Leu Leu Glu Glu Gly Ser Asp Ala Val Val 
5125 5130 5135 

Glu Val Pro His Glu Arg Trp Asp He Asp Ala Phe Tyr Asp Pro Asp 
5140 5145 5150 

Pro Asp Val Arg Gly Lys Met Thr Thr Arg Phe Gly Gly Phe Leu Ser 
5155 ~ ' 5160 5165 

Asp He Asp Arg Phe Asp Pro Ala Phe Phe Gly He Ser Pro Arg Glu 
5170 5175 5180 

Ala Thr Thr Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp 
5185 5190 5195 5200 

Glu Ala Phe Glu Arg Ala Gly He Leu Pro Glu Arg Leu Met Gly Ser 
5205 5210 5215 

Asd Thr Gly Val Phe Val Gly Leu Phe Tyr Gin Glu Tyr Ala Ala Leu 
5220 5225 5230 

Ala Gly Gly He Glu Ala Phe Asp Gly Tyr Leu Gly Thr Gly Thr Thr 
5235 5240 5245 

Ala Ser Val Ala Ser Gly Arg He Ser Tyr Val Leu Gly Leu Lys Gly 
5250 5255 5260 

Pro Ser Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Val 
5265 5270 5275 5280 

His Leu Ala Cys Gin Ala Leu Arg Arg Gly Glu Cys Ser Val Ala Leu 
5285 5290 5295 

Ala Gly Gly Val Ala Leu Met Leu Thr Pro Ala Thr Phe Val Glu Phe 
5300 5305 5310 

Ser Arg Leu Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ser Phe Ser 
5315 5320 5325 

Ala Ala Ala Asp Gly Val Gly Trp Ser Glu Gly Cys Ala Met Leu Leu 
5330 5335 5340 

Leu Lys Pro Leu Arg Asp Ala Gin Arg Asp Gly Asp Pro He Leu Ala 
5345 5350 5355 5360 

Val He Arg Gly Thr Ala Val Asn Gin Asp Gly Arg Ser Asn Gly Leu 
5365 5370 5375 

Thr Ala Pro Asn Gly Ser Ser Gin Gin Glu Val He Arg Arg Ala Leu 
5380 5385 5390 

Glu Gin Ala Gly Leu Ala Pro Ala Asp Val Ser Tyr Val Glu Cys His 
5395 5400 5405 

Gly Thr Gly Thr Thr Leu Gly Asp Pro He Glu Val Gin Ala Leu Gly 
5410 5415 5420 

Ala Val Leu Ala Gin Gly Arg Pro Ser Asp Arg Pro Leu Val He Gly 
5425 5430 5435 5440 

Ser Val Lys Ser Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala 
5445 5450 5455 
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Gly Val lie Lys Val Ala Leu Ala Leu Glu Arg Gly Leu lie Pro Arg 
5460 " 5465 5470 

Ser Leu His Phe Asp Ala Pro Asn Pro His lie Pro Trp Ser Glu Leu 
5475 5480 5485 

Ala Val Gin Val Ala Ala Lys Pro Val Glu Trp Thr Arg Asn Gly Val 
5490 5495 5500 

Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser Gly Thr Asn Ala 
5505 5510 5515 5520 

His Val Val Leu Glu Glu Ala Pro Ala Ala Ala Phe Ala Pro Ala Ala 
5525 5530 5535 

Ala Arg Ser Ala Glu Leu Phe Val Leu Ser Ala Lys Ser Ala Ala Ala 
5540 5545 5550 

Leu Asp Ala Gin Ala Ala Arg Leu Ser Ala His Val Val Ala His Pro 
5555 5560 5565 

Glu Leu Gly Leu Gly Asp Leu Ala Phe Ser Leu Ala Thr Thr Arg Ser 
5570 ~ 5575 5580 

Pro Met Thr Tyr Arg Leu Ala Val Ala Ala Thr Ser Arg Glu Ala Leu 
5585 5590 5595 5600 

Ser Ala Ala Leu Asp Thr Ala Ala Gin Gly Gin Ala Pro Pro Ala Ala 
5605 5610 5615 

Ala Arg Gly His Ala Ser Thr Gly Ser Ala Pro Lys Val Val Phe Val 
5620 5625 5630 

Phe Pro Gly Gin Gly Ser Gin Trp Leu Gly Met Gly Gin Lys Leu Leu 
5635 5640 5645 

Ser Glu Glu Pro Val Phe Arg Asp Ala Leu Ser Ala Cys Asp Arg Ala 
5650 5655 5660 

He Gin Ala Glu Ala Gly Trp Ser Leu Leu Ala Glu Leu Ala Ala Asp 
5665 5670 5675 5680 

Glu Thr Thr Ser Gin Leu Gly Arg He Asp Val Val Gin Pro Ala Leu 
5685 5690 5695 

Phe Ala He Glu Val Ala Leu Ser Ala Leu Trp Arg Ser Trp Gly Val 
5700 5705 5710 

Glu Pro Asp Ala Val Val Gly His Ser Met Gly Glu Val Ala Ala Ala 
5715 5720 5725 

His Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Val Ala He He Cys 
5730 ~ 5735 5740 

Arg Arg Ser Leu Leu Leu Arg Arg He Ser Gly Gin Gly Glu Met Ala 
5745 5750 5755 5760 

Val Val Glu Leu Ser Leu Ala Glu Ala Glu Ala Ala Leu Leu Gly Tyr 
5765 5770 5775 

Glu Asp Arg Leu Ser Val Ala Val Ser Asn Ser Pro Arg Ser Thr Val 
5780 5785 . 5790 

Leu Ala Gly Glu Pro Ala Ala Leu Ala Glu Val Leu Ala He Leu Ala 
5795 5800 5805 
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Ala Lys Gly Val Phe Cys Arg Arg Val Lys Val Asp Val Ala Ser His 
5810 5815 5820 

Ser Pro Gin lie Asp Pro Leu Arg Asp Glu Leu Leu Ala Ala Leu Gly 
5825 5830 5835 5840 

Glu Leu Glu Pro Arg Gin Ala Thr Val Ser Met Arg Ser Thr Val Thr 
5845 5850 5855 

Ser Thr lie Met Ala Gly Pro Glu Leu Val Ala Ser Tyr Trp Ala Asp 
5860 5865 5870 

Asn Val Arg Gin Pro Val Arg Phe Ala Glu Ala Val Gin Ser Leu Met 
5875 5880 5885 

Glu Asp Gly His Gly Leu Phe Val Glu Met Ser Pro His Pro lie Leu 
5890 5895 5900 

Thr Thr Ser Val Glu Glu lie Arg Arg Ala Thr Lys Arg Glu Gly Val 
5905 5910 5915 5920 

Ala Val Gly Ser Leu Arg Arg Gly Gin Asp Glu Arg Leu Ser Met Leu 
5925 5930 5935 

Glu Ala Leu Gly Ala Leu Trp Val His Gly Gin Ala Val Gly Trp Glu 
5940 5945 5950 

Arg Leu Phe Ser Ala Gly Gly Ala Gly Leu Arg Arg Val Pro Leu Pro 
5955 5960 5965 

Thr Tyr Pro Trp Gin Arg Glu Arg Tyr Trp Val Asp Ala Pro Thr Gly 
5970 5975 5980 

Gly Ala Ala Gly Gly Ser Arg Phe Ala His Ala Gly Ser His Pro Leu 
5985 5990 5995 6000 

Leu Gly Glu Met Gin Thr Leu Ser Thr Gin Arg Ser Thr Arg Val Trp 
6005 6010 6015 

Glu Thr Thr Leu Asp Leu Lys Arg Leu Pro Trp Leu Gly Asp His Arg 
6020 6025 6030 

Val Gin Gly Ala Val Val Phe Pro Gly Ala Ala Tyr Leu Glu Met Ala 
6035 6040 6045 

Leu Ser Ser Gly Ala Glu Ala Leu Gly Asp Gly Pro Leu Gin Val Ser 
6050 6055 6060 

Asd Val Val Leu Ala Glu Ala Leu Ala Phe Ala Asp Asp Thr Pro Ala 
6065 6070 6075 6080 

Ala Val Gin Val Met Ala Thr Glu Glu Arg Pro Gly Arg Leu Gin Phe 
6085 6090 6095 

His Val Ala Ser Arg Val Pro Gly His Gly Gly Ala Ala Phe Arg Ser 
6100 6105 6110 

His Ala Arg Gly Val Leu Arg Gin lie Glu Arg Ala Glu Val Pro Ala 
6115 6120 6125 

Arg Leu Asp Leu Ala Ala Leu Arg Ala Arg Leu Gin Ala Ser Ala Pro 
6130 ~ 6135 6140 

Ala Ala Ala Thr Tyr Ala Ala Leu Ala Glu Met Gly Leu Glu Tyr Gly 



6145 



6150 



6155 



6160 
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Pro Ala Phe Gin <31y Leu Val Glu Leu Trp Arg Gly Glu Gly Glu Ala 
6165 6170 6175 

Leu Gly Arg Val Arg Leu Pro Glu Ala Ala Gly Ser Pro Ala Ala Cys 
6180 6185 6190 

Arg Leu His Pro Ala Leu Leu Asp Ala Cys Phe His Val Ser Ser Ala 
6195 6200 6205 

Phe Ala Asp Arg Gly Glu Ala Thr Pro Trp Val Pro Val Glu lie Gly 
6210 ^ 6215 6220 

Ser Leu Arg Trp Phe Gin Arg Pro Ser Gly Glu Leu Trp Cys His Ala 
6225 6230 6235 6240 

Arg Ser Val Ser His Gly Lys Pro Thr Pro Asp Arg Arg Ser Thr Asp 
6245 6250 6255 

Phe Trp Val Val Asp Ser Thr Gly Ala lie Val Ala Glu lie Ser Gly 
6260 6265 6270 

Leu Val Ala Gin Arg Leu Ala Gly Gly Val Arg Arg Arg Glu Glu Asp 
6275 6280 6285 

Asp Trp Phe Met Glu Pro Ala Trp Glu Pro Thr Ala Val Pro Gly Ser 
6290 6295 6300 

Glu Val Met Ala Gly Arg Trp Leu Leu lie Gly Ser Gly Gly Gly Leu 
6305 6310 6315 6320 

Gly Ala Ala Leu His Ser Ala Leu Thr Glu Ala Gly His Ser Val Val 
6325 6330 6335 

His Ala Thr Gly Arg Gly Thr Ser Ala Ala Gly Leu Gin Ala Leu Leu 
6340 6345 6350 

Thr Ala Ser Phe Asp Gly Gin Ala Pro Thr Ser Val Val His Leu Gly 
6355 6360 6365 

Ser Leu Asp Glu Arg Gly Val Leu Asp Ala Asp Ala Pro Phe Asp Ala 
6370 6375 6380 

Asp Ala Leu Glu Glu Ser Leu Val Arg Gly Cys Asp Ser Val Leu Trp 
6385 6390 6395 6400 

Thr Val Gin Ala Val Ala Gly Ala Gly Phe Arg Asp Pro Pro Arg Leu 
6405 6410 6415 

Trp Leu Val Thr Arg Gly Ala Gin Ala lie Gly Ala Gly Asp Val Ser 
6420 ' 6425 6430 

Val Ala Gin 'Ala Pro Leu Leu Gly Leu Gly Arg Val lie Ala Leu Glu 
6435 6440 6445 

His Ala Glu Leu Arg Cys Ala Arg lie Asp Leu Asp Pro Ala Arg Arg 
6450 6455 6460 

Asp Gly Glu Val Asp Glu Leu Leu Ala Glu Leu Leu Ala Asp Asp Ala 
6465 6470 6475 6480 

Glu Glu Glu Val Ala Phe Arg Gly Gly Glu Arg Arg Val Ala Arg Leu 
6485 6490 6495 

Val Arg Arg Leu Pro Glu Thr Asp Cys Arg Glu Lys He Glu Pro Ala 
6500 6505 6510 
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Glu Gly Arg Pro phe Arg Leu Glu lie Asp Gly Ser Gly Val Leu Asp 
. 6515 6520 6525 

Asp Leu Val Leu Arg Ala Thr Glu Arg Arg Pro Pro Gly Pro Gly Glu 
6530 6535 6540 

Val Glu He Ala Val Glu Ala Ala Gly Leu Asn Phe Leu Asp Val Met 
6545 6550 6555 6560 

Arg Ala Met Gly lie Tyr Pro Gly Pro Gly Asp Gly Pro Val Ala Leu 
6565 6570 6575 

Gly Ala Glu Cys Ser Gly Arg He Val Ala Met Gly Glu Gly Val Glu 
6580 6585 6590 

Ser Leu Arg He Gly Gin Asp Val Val Ala Val Ala Pro Phe Ser Phe 
6595 6600 6605 

Gly Thr His Val Thr He Asp Ala Arg Met Leu Ala Pro Arg Pro Ala 
6610 6615 6620 

Ala Leu Thr Ala Ala Gin Ala Ala Ala Leu Pro Val Ala Phe Met Thr 
6625 6630 6635 6640 

Ala Trp Tyr Gly Leu Val His Leu Gly Arg Leu Arg Ala Gly Glu Arg 
6645 6650 6655 

Val Leu He His Ser Ala Thr Gly Gly Thr Gly Leu Ala Ala Val Gin 
6660 6665 6670 

He Ala Arg His Leu Gly Ala Glu He Phe Ala Thr Ala Gly Thr Pro 
6675 6680 6685 

Glu Lys Arg Ala Trp Leu Arg Glu Gin Gly He Ala His Val Met Asp 
6690 ~ 6695 6700 

Ser Arg Ser Leu Asp Phe Ala Glu Gin Val Leu Ala Ala Thr Lys Gly 
6705 6710 6715 6720 

Glu Gly Val Asp Val Val Leu Asn Ser Leu Ser Gly Ala Ala lie Asp 
6725 6730 6735 

Ala Ser Leu Ser Thr Leu Val Pro Asp Gly Arg Phe lie Glu Leu Gly 
6740 6745 6750 

Lys Thr Asp He Tyr Ala Asp Arg Ser Leu Gly Leu Ala His Phe Arg 
6755 6760 6765 

Lvs Ser Leu Ser Tyr Ser Ala Val Asp Leu Ala Gly Leu Ala Val Arg 
6770 6775 6780 

Aro Pro Glu Arg Val Ala Ala Leu Leu Ala Glu Val Val Asp Leu Leu 
6785 ~ 6790 6795 6800 

Ala Arg Gly Ala Leu Gin Pro Leu Pro Val Glu He Phe Pro Leu Ser 
6805 6810 6815 

Arg Ala Ala Asp Ala Phe Arg Lys Met Ala Gin Ala Gin His Leu Gly 
6820 6825 6830 

Lys Leu Val Leu Ala Leu Glu Asp Pro Asp Val Arg lie Arg Val Pro 
6835 6840 6845 

Gly Glu Ser Gly Val Ala He Arg Ala Asp Gly Ala Tyr Leu Val Thr 
6850 6855 6860 
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Gly Gly Leu Gly Gly Leu Gly Leu Ser Val Ala Gly Trp Leu Ala Glu 
6865 6870 6875 6880 

Gin Gly Ala Gly His Leu Val Leu Val Gly Arg Ser Gly Ala Val Ser 
6885 6890 6895 

Ala Glu Gin Gin Thr Ala Val Ala Ala Leu Glu Ala His Gly Ala Arg 
6900 6905 6910 

Val Thr Val Ala Arg Ala Asp Val Ala Asp Arg Ala Gin Met Glu Arg 
6915 6920 6925 

lie Leu Arg Glu Val Thr Ala Ser Gly Met Pro Leu Arg Gly Val Val 
6930 6935 6940 

His Ala Ala Gly lie Leu Asp Asp Gly Leu Leu Met Gin Gin Thr Pro 
6945 6950 6955 .6960 

Ala Arg Phe Arg Ala Val Met Ala Pro Lys Val Arg Gly Ala Leu His 
6965 6970 6975 

Leu His Ala Leu Thr Arg Glu Ala Pro Leu Ser Phe Phe Val Leu Tyr 
6980 6985 6990 

Ala Ser Gly Ala Gly Leu Leu Gly Ser Pro Gly Gin Gly Asn Tyr Ala 
6995 7000 7005 

Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala His His Arg Arg Ala Gin 
7010 7015 7020 

Gly Leu Pro Ala Leu Ser lie Asp Trp Gly Leu Phe Ala Asp Val Gly 
7025 7030 7035 7040 

Leu Ala Ala Gly Gin Gin Asn Arg Gly Ala Arg Leu Val Thr Arg Gly 
7045 7050 7055 

Thr Arg Ser Leu Thr Pro Asp Glu Gly Leu Trp Ala Leu Glu Arg Leu 
7060 7065 7070 

Leu Asp Gly Asp Arg Thr Gin Ala Gly Val Met Pro Phe Asp Val Arg 
7075 ~ 7080 7085 

Gin Trp Val Glu Phe Tyr Pro Ala Ala Ala Ser Ser Arg Arg Leu Ser 
7090 7095 7100 

Arg Leu Met Thr Ala Arg Arg Val Ala Ser Gly Arg Leu Ala Gly Asp 
7105 7110 7115 7120 

Arg Asp Leu Leu Glu Arg Leu Ala Thr Ala Glu Ala Gly Ala Arg Ala 
7125 7130 7135 

Gly Met Leu Gin Glu Val Val Arg Ala Gin Val Ser Gin Val Leu Arg 
7140 7145 7150 

Leu Ser Glu Gly Lys Leu Asp Val Asp Ala Pro Leu Thr Ser Leu Gly 
7155 7160 7165 

Met Asp Ser Leu Met Gly Leu Glu Leu Arg Asn Arg lie Glu Ala Val 
7170 7175 7180 

Leu Gly lie Thr Met Pro Ala Thr Leu Leu Trp Thr Tyr Pro Thr Val 
7185 7190 7195 7200 

Ala Ala Leu Ser Ala His Leu Ala Ser His Val Val Ser Thr Gly Asp 



7205 



7210 



7215 
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Gly Glu Ser Ala Arg Pro Pro Asp Thr Gly Ser Val Ala Pro Thr Thr 
7220 7225 7230 

His Glu Val Ala Ser Leu Asp Glu Asp Gly Leu Phe Ala Leu He Asp 
7235 7240 7245 

Glu Ser Leu Ala Arg Ala Gly Lys Arg 
7250 7255 

<210> 6 
<211> 3798 
<212> PRT 

<213> Sorangium cellulosum 

Vai°Thr Asp Arg Glu Gly Gin Leu Leu Glu Arg Leu Arg Glu Val Thr 
1 5 10 15 

Leu Ala Leu Arg Lys Thr Leu Asn Glu Arg Asp Thr Leu Glu Leu Glu 
20 25 30 

Lys Thr Glu Pro He Ala He Val Gly He Gly Cys Arg Phe Pro Gly 
35 40 45 

Glv Ala Gly Thr Pro Glu Ala Phe Trp Glu Leu Leu Asp Asp Gly Arg 
50 55 60 

Asp Ala He Arg Pro Leu Glu Glu Arg Trp Ala Leu Val Gly Val Asp 
65 70 75 80 

Pro Gly Asp Asp Val Pro Arg Trp Ala Gly Leu Leu Thr Glu Ala He 
85 90 95 

Asp Gly Phe Asp Ala Ala Phe Phe Gly He Ala Pro Arg Glu Ala Arg 
100 105 110 

Ser Leu Asp Pro Gin His Arg Leu Leu Leu Glu Val Ala Trp Glu Gly 
115 120 125 

Phe Glu Asp Ala Gly He Pro Pro Arg Ser Leu Val Gly Ser Arg Thr 
130 135 140 

Gly Val Phe Val Gly Val Cys Ala Thr Glu Tyr Leu His Ala Ala Val 
145 150 155 160 

Ala His Gin Pro Arg Glu Glu Arg Asp Ala Tyr Ser Thr Thr Gly Asn 
165 170 175 

Met Leu Ser He Ala Ala Gly Arg Leu Ser Tyr Thr Leu Gly Leu Gin 
180 185 190 

Gly Pro Cys Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
195 200 205 

He His Leu Ala Cys Arg Ser Leu Arg Ala Arg Glu Ser Asp Leu Ala 
210 215 220 

Leu Ala Gly Gly Val Asn Met Leu Leu Ser Pro Asp Thr Met Arg Ala 
225 230 235 240 

Leu Ala Arg Thr Gin Ala Leu Ser Pro Asn Gly Arg Cys Gin Thr Phe 
245 250 255 

Asp Ala Ser Ala Asn Gly Phe Val Arg Gly Glu Gly Cys Gly Leu He 
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260 265 270 

Val Leu Lys Arg Leu Ser Asp Ala Arg Arg Asp Gly Asp Arg lie Trp 
275 ~ 280 285 

Ala Leu He Arg Gly Ser Ala He Asn Gin Asp Gly Arg Ser Thr Gly 
290 ~ 295 300 

Leu Thr Ala Pro Asn Val Leu Ala Gin Gly Ala Leu Leu Arg Glu Ala 
305 310 315 320 

Leu Arg Asn Ala Gly Val Glu Ala Glu Ala He Gly Tyr He Glu Thr 
325 330 335 

His Gly Ala Ala Thr Ser Leu Gly Asp Pro He Glu He Glu Ala Leu 
340 345 350 

Arg Ala Val Val Gly Pro Ala Arg Ala Asp Gly Ala Arg Cys Val Leu 
355 360 365 

Gly Ala Val Lys Thr Asn Leu Gly His Leu Glu Gly Ala Ala Gly Val 
370 "* 375 380 

Ala Gly Leu He Lys Ala Thr Leu Ser Leu His His Glu Arg He Pro 
385 390 395 400 

Arg Asn Leu Asn Phe Arg Thr Leu Asn Pro Arg He Arg He Glu Gly 
405 410 415 

Thr Ala Leu Ala Leu Ala Thr Glu Pro Val Pro Trp Pro Arg Thr Gly 
420 425 430 

Arg Thr Arg Phe Ala Gly Val Ser Ser Phe Gly Met Ser Gly Thr Asn 
435 440 445 

Ala His Val Val Leu Glu Glu Ala Pro Ala Val Glu Pro Glu Ala Ala 
450 455 460 

Ala Pro Glu Arg Ala Ala Glu Leu Phe Val Leu Ser Ala Lys Ser Ala 
465 470 475 480 

Ala Ala Leu Asp Ala Gin Ala Ala Arg Leu Arg Asp His Leu Glu Lys 
485 490 495 

His Val Glu Leu Gly Leu Gly Asp Val Ala Phe Ser Leu Ala Thr Thr 
500 505 510 

Arg Ser Ala Met Glu His Arg Leu Ala Val Ala Ala . Ser Ser Arg Glu 
515 520 525 

Ala Leu Arg Gly Ala Leu Ser Ala Ala Ala Gin Gly His Thr Pro Pro 
530 " " 535 540 

Gly Ala Val Arg Gly Arg Ala Ser Gly Gly Ser Ala Pro Lys Val Val 
545 550 555 560 

Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Val Gly Met Gly Arg Lys 
565 570 575 

Leu Met Ala Glu Glu Pro Val Phe Arg Ala Ala Leu Glu Gly Cys Asp 
580 585 590 

Arg Ala He Glu Ala Glu Ala Gly Trp Ser Leu Leu Gly Glu Leu Ser 
595 600 605 

Ala Asp Glu Ala Ala Ser Gin Leu Gly Arg He Asp Val Val Gin Pro 
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610 615 620 

Val Leu Phe Ala Met Glu Val Ala Leu Ser Ala Leu Trp Arg Ser Trp 
625 630 635 640 

GIy Val Glu Pro Glu Ala Val Val Gly His Ser Met Gly Glu Val Ala 
645 650 655 

Ala Ala His Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Val Ala lie 
660 665 670 

lie Cys Arg Arg Ser Arg Leu Leu Arg Arg lie Ser Gly Gin Gly Glu 
675 680 685 

Met Ala Leu Val Glu Leu Ser Leu Glu Glu Ala Glu Ala Ala Leu Arg 
690 695 700 

Glv His Glu Gly Arg Leu Ser Val Ala Val Ser Asn Ser Pro Arg Ser 
705 710 715 720 

Thr Val Leu Ala Gly Glu Pro Ala Ala Leu Ser Glu Val Leu Ala Ala 
725 730 735 

Leu Thr Ala Lys Gly Val Phe Trp Arg Gin Val Lys Val Asp Val Ala 
740 745 750 

Ser His Ser Pro Gin Val Asp Pro Leu Arg Glu Glu Leu lie Ala Ala 
755 760 765 

Leu Gly Ala He Arg Pro Arg Ala Ala Ala Val Pro Met Arg Ser Thr 
770 775 780 

Val Thr Gly Gly Val He Ala Gly Pro Glu Leu Gly Ala Ser Tyr Trp 
785 ~ 790 795 800 

Ala Asp Asn Leu Arg Gin Pro Val Arg Phe Ala Ala Ala Ala Gin Ala 
805 810 815 

Leu Leu Glu Gly Gly Pro Ala Leu Phe He Glu Met Ser Pro His Pro 
820 825 830 

He Leu Val Pro Pro Leu Asp Glu He Gin Thr Ala Ala Glu Gin Gly 
835 • 840 845 

Gly Ala Ala Val Gly Ser Leu Arg Arg Gly Gin Asp Glu Arg Ala Thr 
850 855 860 

Leu Leu Glu Ala Leu Gly Thr Leu Trp Ala Ser Gly Tyr Pro Val Ser 
865 870 875 880 

Trp Ala Arg Leu Phe Pro Ala Gly Gly Arg Arg Val Pro Leu Pro Thr 
885 890 895 

Tyr Pro Trp Gin His Glu Arg Cys Trp He Glu Val Glu Pro Asp Ala 
900 905 910 

Arg Arg Leu Ala Ala Ala Asp Pro Thr Lys Asp Trp Phe Tyr Arg Thr 
915 920 925 

Asp Trp Pro Glu Val Pro Arg Ala Ala Pro Lys Ser Glu Thr Ala His 
930 935 940 

Gly Ser Trp Leu Leu Leu Ala Asp Arg Gly Gly Val Gly Glu Ala Val 
945 950 955 960 

Ala Ala Ala Leu Ser Thr Arg Gly Leu Ser Cys Thr Val Leu His Ala 
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965 



970 



975 



Ser Ala Asp Ala Ser Thr Val Ala Glu Gin Val Ser Glu Ala Ala Ser 
980 985 990 

Arg Arg Asn Asp Trp Gin Gly Val Leu Tyr Leu Trp Gly Leu Asp Ala 
995 ~ 1000 1005 

Val Val Asp Ala Gly Ala Ser Ala Asp Glu Val Ser Glu Ala Thr Arg 
1010 1015 1020 

Arg Ala Thr Ala Pro Val Leu Gly Leu Val Arg Phe Leu Ser Ala Ala 
1025 1030 1035 1040 

Pro His Pro Pro Arg Phe Trp Val Val Thr Arg Gly Ala Cys Thr Val 
1045 1050 1055 

Gly Gly Glu Pro Glu Ala Ser Leu Cys Gin Ala Ala Leu Trp Gly Leu 
1060 1065 1070 

Ala Arg Val Ala Ala Leu Glu His Pro Ala Ala Trp Gly Gly Leu Val 
1075 1080 1085 

Asp Leu Asp Pro Gin Lys Ser Pro Thr Glu lie Glu Pro Leu Val Ala 
1090 1095 1100 

Glu Leu Leu Ser Pro Asp Ala Glu Asp Gin Leu Ala Phe Arg Ser Gly 
1105 1110 1115 1120 

Arg Arg His Ala Ala Arg Leu Val Ala Ala Pro Pro Glu Gly Asp Val 
1125 1130 1135 

Ala Pro He Ser Leu Ser Ala Glu Gly Ser Tyr Leu Val Thr Gly Gly 
1140 1145 1150 

Leu Gly Gly Leu Gly Leu Leu Val Ala Arg Trp Leu Val Glu Arg Gly 
1155 1160 1165 

Ala Arg His Leu Val Leu Thr Ser Arg His Gly Leu Pro Glu Arg Gin 
1170 H75 1180 

Ala Ser Gly Gly Glu Gin Pro Pro Glu Ala Arg Ala Arg He Ala Ala 
1185 ~ " H90 . 1195 1200 

Val Glu Gly Leu Glu Ala Gin Gly Ala Arg Val Thr Val Ala Ala Val 
1205 1210 1215 

Asp Val Ala Glu Ala Asp Pro Met Thr Ala Leu Leu Ala Ala He Glu 
1220 1225 1230 

Pro Pro Leu Arg Gly Val Val His Ala Ala Gly Val Phe Pro Val Arg 
1235 ~ 1240 1245 

His Leu Ala Glu Thr Asp Glu Ala Leu Leu Glu Ser Val Leu Arg Pro 
1250 1255 1260 

Lys Val Ala Gly Ser Trp Leu Leu His Arg Leu Leu Arg Asp Arg Pro 
1265 1270 1275 1280 

Leu Asp Leu Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Gly 
1285 1290 1295 

Lys Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Phe Leu Asp Gly Leu 
1300 1305 1310 

Ala His His Arg Arg Ala His Ser Leu Pro Ala Leu Ser Leu Ala Trp 



4-30582A 




-57- 



1315 



1320 



1325 



Gly Leu Trp Ala Glu Gly Gly Met Val Asp Ala Lys Ala His Ala Arg 
1330 1335 1340 

Leu Ser Asp lie Gly Val Leu Pro Met Ala Thr Gly Pro Ala Leu Ser 
1345 1350 1355 1360 

Ala Leu Glu Arg Leu Val Asn Thr Ser Ala Val Gin Arg Ser Val Thr 
1365 1370 1375 

Arg Met Asp Trp Ala Arg Phe Ala Pro Val Tyr Ala Ala Arg Gly Arg 
1380 1385 1390 

Arg Asn Leu Leu Ser Ala Leu Val Ala Glu Asp Glu Arg Ala Ala Ser 
1395 1400 1405 

Pro Pro Val Pro Thr Ala Asn Arg lie Trp Arg Gly Leu Ser Val Ala 
1410 1415 1420 

Glu Ser Arg Ser Ala Leu Tyr Glu Leu Val Arg Gly lie Val Ala Arg 
1425 1430 1435 1440 

Val Leu Gly Phe Ser Asp Pro Gly Ala Leu Asp Val Gly Arg Gly Phe 
1445 1450 1455 

Ala Glu Gin Gly Leu Asp Ser Leu Met Ala Leu Glu He Arg Asn Arg 
1460 1465 1470 

Leu Gin Arg Glu Leu Gly Glu Arg Leu Ser Ala Thr Leu Ala Phe Asp 
1475 1480 1485 

His Pro Thr Val Glu Arg Leu Val Ala His Leu Leu Thr Asp Val Leu 
1490 1495 1500 

Lys Leu Glu Asp Arg Ser Asp Thr Arg His He Arg Ser Val Ala Ala 
1505 " 1510 1515 1520 

Asp Asp Asp He Ala He Val Gly Ala Ala Cys Arg Phe Pro Gly Gly 
1525 1530 1535 

Asp Glu Gly Leu Glu Thr Tyr Trp Arg His Leu Ala Glu Gly Met Val 
1540 1545 1550 

Val Ser Thr Glu Val Pro Ala Asp Arg Trp Arg Ala Ala Asp Trp Tyr 
1555 1560 1565 

Asp Pro Asp Pro Glu Val Pro Gly Arg Thr Tyr Val Ala Lys Gly Ala 
1570 1575 1580 

Phe Leu Arg Asp Val Arg Ser Leu Asp Ala Ala Phe Phe Ala He Ser 
1585 ^ 1590 1595 1600 

Pro Arg Glu Ala Met Ser Leu Asp Pro Gin Gin Arg Leu Leu Leu Glu 
1605 1610 1615 

Val Ser Trp Glu Ala He Glu Arg Ala Gly Gin Asp Pro Met Ala Leu 
1620 1625 1630 

Arg Glu Ser Ala Thr Gly Val Phe Val Gly Met He Gly Ser Glu His 
1635 1640 1645 

Ala Glu Arg Val Gin Gly Leu Asp Asp Asp Ala Ala Leu Leu Tyr Gly 
1650 1655 1660 

Thr Thr Gly Asn Leu Leu Ser Val Ala Ala Gly Arg Leu Ser Phe Phe 
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1665 1670 1675 1680 

Leu Gly Leu His Gly Pro Thr Met Thr Val Asp Thr Ala Cys Ser Ser 
1685 1690 1695 

Ser Leu Val Ala Leu His Leu Ala Cys Gin Ser Leu Arg Leu Gly Glu 
1700 1705 1710 

Cys Asp Gin Ala Leu Ala Gly Gly Ser Ser Val Leu Leu Ser Pro Arg 
1715 1720 1725 

Ser Phe Val Ala Ala Ser Arg Met Arg Leu Leu Ser Pro Asp Gly Arg 
1730 1735 1740 

Cvs Lys Thr Phe Ser Ala Ala Ala Asp Gly Phe Ala Arg Ala Glu Gly 
1745 1750 1755 1760 

Cvs Ala Val Val Val Leu Lys Arg Leu Arg Asp Ala Gin Arg Asp Arg 
1765 1770 1775 

Asp Pro He Leu Ala Val Val Arg Ser Thr Ala He Asn His Asp Gly 
1780 1785 1790 

Pro Ser Ser Gly Leu Thr Val Pro Ser Gly Pro Ala Gin Gin Ala Leu 
1795 1800 1805 

Leu Arg Gin Ala Leu Ala Gin Ala Gly Val Ala Pro Ala Glu Val Asp 
1810 1815 1820 

Phe Val Glu Cys His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu 
1825 1830 1835 1840 

Val Gin Ala Leu Gly Ala Val Tyr Gly Arg Gly Arg Pro Ala Glu Arg 
1845 1850 1855 

Pro Leu Trp Leu Gly Ala Val Lys Ala Asn Leu Gly His Leu Glu Ala 
I860 1865 1870 

Ala Ala Gly Leu Ala Gly Val Leu Lys Val Leu Leu Ala Leu Glu His 
1875 1880 1885 

Glu Gin He Pro Ala Gin Pro Glu Leu Asp Glu Leu Asn Pro His He 
1890 1895 1900 

Pro Trp Ala Glu Leu Pro Val Ala Val Val Arg Arg Ala Val Pro Trp 
1905 1910 1915 1920 

Pro Arg Gly Ala Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Leu 
1925 1930 1935 

Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala Val Glu 
1940 1945 1950 

Pro Val Ala Ala Ala Pro Glu Arg Ala Ala Glu Leu Phe Val Leu Ser 
1955 I960 1965 

' Ala Lys Ser Ala Ala Ala Leu Asp Ala Gin Ala Ala Arg Leu Arg Asp 
1970 1975 1980 

His Leu Glu Lys His Val Glu Leu Gly Leu Gly Asp Val Ala Phe Ser 
1985 1990 1995 2000 

Leu Ala Thr Thr Arg Ser Ala Met Glu His Arg Leu Ala Val Ala Ala 
2005 2010 201§ 

Ser Ser Arg Glu Ala Leu Arg Gly Ala Leu Ser Ala Ala Ala Gin Gly 
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2020 2025 2030 

His Thr Pro Pro Gly Ala Val Arg Gly Arg Ala Ser Gly Gly Ser Ala 
2035 2040 2045 

Pro Lys Val Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Val Gly 
2050 2055 2060 

Met Gly Arg Lys Leu Met Ala Glu Glu Pro Val Phe Arg Ala Ala Leu 
2065 2070 2075 2080 

Glu Gly Cys Asp Arg Ala He Glu Ala Glu Ala Gly Trp Ser Leu Leu 
2085 2090 2095 

Glv Glu Leu Ser Ala Asp Glu Ala Ala Ser Gin Leu Gly Arg He Asp 
2100 2105 2110 

Val Val Gin Pro Val Leu Phe Ala Met Glu Val Ala Leu Ser Ala Leu 
2115 2120 2125 

Trp Arg Ser Trp Gly Val Glu Pro Glu Ala Val Val Gly His Ser Met 
q 2130 2135 2140 

° Glv Glu Val Ala Ala Ala His Val Ala Gly Ala Leu Ser Leu Glu Asp 
l'+ 2145 2150 2155 2160 

M Ala Val Ala He He Cys Arg Arg Ser Arg Leu Leu Arg Arg He Ser 
5 ? 2165 2170 2175 

^ Gly Gin Gly Glu Met Ala Leu Val Glu Leu Ser Leu Glu Glu Ala Glu 
2180 2185 2190 

Ala Ala Leu Arg Gly His Glu Gly Arg Leu Ser Val Ala Val Ser Asn 
!■* 2195 2200 2205 

id Ser Pro Arg Ser Thr Val Leu Ala Gly Glu Pro Ala Ala Leu Ser Glu 
2210 2215 2220 



Val Leu Ala Ala Leu Thr Ala Lys Gly Val Phe Trp Arg Gin Val Lys 
2225 2230 2235 2240 

Val Asp Val Ala Ser His Ser Pro Gin Val Asp Pro Leu Arg Glu Glu 
2245 2250 2255 

Leu He Ala Ala Leu Gly Ala He Arg Pro Arg Ala Ala Ala Val Pro 
2260 2265 2270 

Met Arg Ser Thr Val Thr Gly Gly Val He Ala Gly Pro Glu Leu Gly 
2275 2280 2285 

Ala Ser Tyr Trp Ala Asp Asn Leu Arg Gin Pro Val Arg Phe Ala Ala 
2290 ' 2295 2300 

Ala Ala Gin Ala Leu Leu Glu Gly Gly Pro Ala Leu Phe He Glu Met 
2305 2310 2315 12320 

Ser Pro His Pro He Leu Val Pro Pro Leu Asp Glu He Gin Thr Ala 
2325 2330 2335 

Ala Glu Gin Gly Gly Ala Ala Val Gly Ser Leu Arg Arg Gly Gin Asp 
2340 2345 2350 

Glu Arg Ala Thr Leu Leu Glu Ala Leu Gly Thr Leu Trp Ala Ser Gly 
2355 2360 2365 

Tyr Pro Val Ser Trp Ala Arg Leu Phe Pro Ala Gly Gly Arg Arg Val 
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2370 2375 2380 

Pro Leu Pro Thr Tyr Pro Trp Gin His Glu Arg Tyr Trp He Glu Asp 
2385 2390 2395 2400 

Ser Val His Gly Ser Lys Pro Ser Leu Arg Leu Arg Gin Leu Arg Asn 
2405 2410 2415 

Gly Ala Thr Asp His Pro Leu Leu Gly Ala Pro Leu Leu Val Ser Ala 
2420 2425 2430 

Arg Pro Gly Ala His Leu Trp Glu Gin Ala Leu Ser Asp Glu Arg Leu 
2435 2440 2445 

Ser Tyr Leu Ser Glu His Arg Val His Gly Glu Ala Val Leu Pro Ser 
2450 2455 2460 

Ala Ala Tyr Val Glu Met Ala Leu Ala Ala Gly Val Asp Leu Tyr Gly 
2465 2470 2475 2480 

Thr Ala Thr Leu Val Leu Glu Gin Leu Ala Leu Glu Arg Ala Leu Ala 
2485 2490 2495 

Val Pro Ser Glu Gly Gly Arg He Val Gin Val Ala Leu Ser Glu Glu 
2500 2505 2510 

Gly Pro Gly Arg Ala Ser Phe Gin Val Ser Ser Arg Glu Glu Ala Gly 
2515 2520 2525 

Arg Ser Trp Val Arg His Ala Thr Gly His Val Cys Ser Gly Gin Ser 
2530 2535 2540 

Ser Ala Val Gly Ala Leu Lys Glu Ala Pro Trp Glu He Gin Arg Arg 
2545 * 2550 2555 2560 

Cvs Pro Ser Val Leu Ser Ser Glu Ala Leu Tyr Pro Leu Leu Asn Glu 
2565 2570 2575 

His Ala Leu Asp Tyr Gly Pro Cys Phe Gin Gly Val Glu Gin Val Trp 
2580 2585 2590 

Leu Gly Thr Gly Glu Val Leu Gly Arg Val Arg Leu Pro Gly Asp Met 
2595 ** 2600 2605 

Ala Ser Ser Ser Gly Ala Tyr Arg He His Pro Ala Leu Leu Asp Ala 
2610 2615 2620 

Cvs Phe Gin Val Leu Thr Ala Leu Leu Thr Thr Pro Glu Ser He Glu 
2625 2630 2635 2640 

He Arg Arg Arg Leu Thr Asp Leu His Glu Pro Asp Leu Pro Arg Ser 
2645 2650 2655 

Arg Ala Pro Val Asn Gin Ala Val Ser Asp Thr Trp Leu Trp Asp Ala 
2660 2665 2670 

Ala Leu Asp Gly Gly Arg Arg Gin Ser Ala Ser Val Pro Val Asp Leu 
2675 2680 2685 

Val Leu Gly Ser Phe His Ala Lys Trp Glu Val Met Glu Arg Leu Ala 
2690 2695 2700 

Gin Ala Tyr He He Gly Thr Leu Arg He Trp Asn Val Phe Cys Ala 
2705 2710 2715 2720 

Ala Gly Glu Arg His Thr He Asp Glu Leu Leu Val Arg Leu Gin He 
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2725 2730 2735 

Ser Val Val Tyr Arg Lys Val lie Lys Arg Trp Met Glu His Leu Val 
2740 2745 2750 

Ala lie Gly He Leu Val Gly Asp Gly Glu His Phe Val Ser Ser Gin 
2755 2760 2765 

Pro Leu Pro Glu Pro Asp Leu Ala Ala Val Leu Glu Glu Ala Gly Arg 
2770 2775 2780 

Val Phe Ala Asp Leu Pro Val Leu Phe Glu Trp Cys Lys Phe Ala Gly 
2785 2790 2795 2800 

Glu Arg Leu Ala Asp Val Leu Thr Gly Lys Thr Leu Ala Leu Glu He 
2805 2810 2815 

Leu Phe Pro Gly Gly Ser Phe Asp Met Ala Glu Arg He Tyr Arg Asp 
2820 2825 2830 

Ser Pro He Ala Arg Tyr Ser Asn Gly He Val Arg Gly Val Val Glu 
2835 2840 2845 

Ser Ala Ala Arg Val Val Ala Pro Ser Gly Met Phe Ser He Leu Glu 
2850 " 2855 2860 

He Gly Ala Gly Thr Gly Ala Thr Thr Ala Ala Val Leu Pro Val Leu 
2865 2870 2875 2880 

Leu Pro Asp Arg Thr Glu Tyr His Phe Thr Asp Val Ser Pro Leu Phe 
2885 2890 2895 

Leu Ala Arg Ala Glu Gin Arg Phe Arg. Asp Tyr Pro Phe Leu Lys Tyr 
2900 2905 2910 

Gly He Leu Asp Val Asp Gin Glu Pro Ala Gly Gin Gly Tyr Ala His 
2915 2920 2925 

Gin Arg Phe Asp Val He Val Ala Ala Asn Val He His Ala Thr Arg 
2930 • 2935 2940 

Asp He Arg Ala Thr Ala Lys Arg Leu Leu Ser Leu Leu Ala Pro Gly 
2945 ' 2950 2955 2960 

Gly Leu Leu Val Leu Val Glu Gly Thr Gly His Pro He Trp Phe Asp 
2965 2970 2975 

He Thr Thr Gly Leu He Glu Gly Trp Gin Lys Tyr Glu Asp Asp Leu 
2980 2985 2990 

Arg He Asp His Pro Leu Leu Pro Ala Arg Thr Trp Cys Asp Val Leu 
2995 3000 3005 

Arq Arg Val Gly Phe Ala Asp Ala Val Ser Leu Pro Gly Asp Gly Ser 
3010 3015 3020 

Pro Ala Gly He Leu Gly Gin His Val He Leu Ser Arg Ala Pro Gly 
3025 3030 3035 3040 

He Ala Gly Ala Ala Cys Asp Ser Ser Gly Glu Ser Ala Thr Glu Ser 
3045 3050 3055 

Pro Ala Ala Arg Ala Val Arg Gin Glu Trp Ala Asp Gly Ser Ala Asp 
3060 3065 3070 

Val Val His Arg Met Ala Leu Glu Arg Met Tyr Phe His Arg Arg Pro 
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3075 3080 3085 

Gly Arg Gin Val Trp Val His Gly Arg Leu Arg Thr Gly Gly Gly Ala 
3090 3095 3100 

Phe Thr Lys Ala Leu Ala Gly Asp Leu Leu Leu Phe Glu Asp Thr Gly 
3105 3110 3115 3120 

Gin Val Val Ala Glu Val Gin Gly Leu Arg Leu Pro Gin Leu Glu Ala 
3125 3130 3135 

Ser Ala Phe Ala Pro Arg Asp Pro Arg Glu Glu Trp Leu Tyr Ala Leu 
3140 3145 3150 

Glu Trp Gin Arg Lys Asp Pro lie Pro Glu Ala Pro Ala Ala Ala Ser 
3155 ~ 3160 3165 

Ser Ser Ser Ala Gly Ala Trp Leu Val Leu Met Asp Gin Gly Gly Thr 
3170 3175 3180 

Gly Ala Ala Leu Val Ser Leu Leu Glu Gly Arg Gly Glu Ala Cys Val 
3185 3190 3195 3200 

Arg Val He Ala Gly Thr Ala Tyr Ala Cys Leu Ala Pro Gly Leu Tyr 
3205 3210 3215 

Gin Val Asp Pro Ala Gin Pro Asp Gly Phe His Thr Leu Leu Arg Asp 
3220 3225 3230 

Ala Phe Gly Glu Asp Arg He Cys Arg Ala Val Val His Met Trp Ser 
3235 3240 3245 

Leu Asp Ala Thr Ala Ala Gly Glu Arg Ala Thr Ala Glu Ser Leu Gin 
3250 3255 3260 

Ala Asp Gin Leu Leu Gly Ser Leu Ser Ala Leu Ser Leu Val Gin Ala 
3265 3270 3275 3280 

Leu Val Arg Arg Arg Trp Arg Asn Met Pro Arg Leu Trp Leu Leu Thr 
3285 3290 3295 

Arg Ala Val His Ala Val Gly Ala Glu Asp Ala Ala Ala Ser Val Ala 
3300 3305 3310 

Gin Ala Pro Val Trp Gly Leu Gly Arg Thr Leu Ala Leu Glu His Pro 
3315 3320 3325 

Glu Leu Arg Cys Thr Leu Val Asp Val Asn Pro Ala Pro Ser Pro Glu 
3330 3335 3340 

Asp Ala Ala Ala Leu Ala Val Glu Leu Gly Ala Ser Asp Arg Glu Asp 
3345 3350 3355 3360 

Gin Val Ala Leu Arg Ser Asp Gly Arg Tyr Val Ala Arg Leu Val Arg 
3365 3370 3375 

Ser Ser Phe Ser Gly Lys Pro Ala Thr Asp Cys Gly He Arg Ala Asp 
3380 3385 3390 

Gly Ser Tyr Val He Thr Asp Gly Met Gly Arg Val Gly Leu Ser Val 
3395 3400 3405 

Ala Gin Trp Met Val Met Gin Gly Ala Arg His Val Val Leu Val Asp 
3410 3415 3420 

Arg Gly Gly Ala Ser Glu Ala Ser Arg Asp Ala Leu Arg Ser Met Ala 
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3425 3430 3435 3440 

Glu Ala Gly Ala Glu Val Gin He Val Glu Ala Asp Val Ala Arg Arg 
3445 3450 3455 

Asp Asp Val Ala Arg Leu Leu Ser Lys He Glu Pro Ser Met Pro Pro 
3460 ^ 3465 3470 

Leu Arg Gly He Val Tyr Val Asp Gly Thr Phe Gin Gly Asp Ser Ser 
3475 3480 3485 

Met Leu Glu Leu Asp Ala Arg Arg Phe Lys Glu Trp Met Tyr Pro Lys 
3490 3495 3500 

Val Leu Gly Ala Trp Asn Leu His Ala Leu Thr Arg Asp Arg Ser Leu 
3505 3510 3515 3520 

Asp Phe Phe Val Leu Tyr Ser Ser Gly Thr Ser Leu Leu Gly Leu Pro 
3525 3530 3535 

Gly Gin Gly Ser Arg Ala Ala Gly Asp Ala Phe Leu Asp Ala He Ala 
3540 3545 3550 

His His Arg Cys Lys Val Gly Leu Thr Ala Met Ser He Asn Trp Gly 
3555 3560 3565 

Leu Leu Ser Glu Ala Ser Ser Pro Ala Thr Pro Asn Asp Gly Gly Ala 
3570 3575 3580 

Arg Leu Glu Tyr Arg Gly Met Glu Gly Leu Thr Leu Glu Gin Gly Ala 
3585 3590 3595 3600 

Ala Ala Leu Gly Arg Leu Leu Ala Arg Pro Arg Ala Gin Val Gly Val 
3605 3610 3615 

Met Arg Leu Asn Leu Arg Gin Trp Leu Glu Phe Tyr Pro Asn Ala Ala 
3620 ~ 3625 3630 

Arg Leu Ala Leu Trp Ala Glu Leu Leu Lys Glu Arg Asp Arg Ala Asp 
3635 3640 3645 

Arg Gly Ala Ser Asn Ala Ser Asn Leu Arg Glu Ala Leu Gin Ser Ala 
3650 3655 3660 

Arg Pro Glu Asp Arg Gin Leu He Leu Glu Lys His Leu Ser Glu Leu 
3665 3670 3675 3680 

Leu Gly Arg Gly Leu Arg Leu Pro Pro Glu Arg He Glu Arg His Val 
3685 ~ 3690 3695 

Pro Phe Ser Asn Leu Gly Met Asp Ser Leu He Gly Leu Glu Leu Arg 
3700 3705 3710 

Asn Arg He Glu Ala Ala Leu Gly He Thr Val Pro Ala Thr Leu Leu 
3715 3720 3725 

Trp Thr Tyr Pro Asn Val Ala Ala Leu Ser Gly Ser Leu Leu Asp He 
3730 3735 3740 

Leu Phe Pro Asn Ala Gly Ala Thr His Ala Pro Ala Thr Glu Arg Glu 
3745 3750 3755 3760 

Lys Ser Phe Glu Asn Asp Ala Ala Asp Leu Glu Ala Leu Arg Gly Met 
3765 3770 3775 

Thr Asp Glu Gin Lys Asp Ala Leu Leu Ala Glu Lys Leu Ala Gin Leu 
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3780 3785 3790 



Ala Gin He Val Gly Glu 
3795 



<210> 7 
<211> 2439 
<212> PRT 

<213> Sorangium cellulosum 
<400> 7 

Met Ala Thr Thr Asn Ala Gly Lys Leu Glu His Ala Leu Leu Leu Met 
1 5 10 15 

Asp Lys Leu Ala Lys Lys Asn Ala Ser Leu Glu Gin Glu Arg Thr Glu 
20 25 30 

Pro He Ala He Val Gly He Gly Cys Arg Phe Pro Gly Gly Ala Asp 
35 40 45 

Thr Pro Glu Ala Phe Trp Glu Leu Leu Asp Ser Gly Arg Asp Ala Val 
50 55 60 

Gin Pro Leu Asp Arg Arg Trp Ala Leu Val Gly Val His Pro Ser Glu 
65 70 75 80 

Glu Val Pro Arg Trp Ala Gly Leu Leu Thr Glu Ala Val Asp Gly Phe 
85 90 95 

Asp Ala Ala Phe Phe Gly Thr Ser Pro Arg Glu Ala Arg Ser Leu Asp 
100 105 110 

Pro Gin Gin Arg Leu Leu Leu Glu Val Thr Trp Glu Gly Leu Glu Asp 
115 120 125 . 

Ala Gly He Ala Pro Gin Ser Leu Asp Gly Ser Arg Thr Gly Val Phe 
130 135 140 

Leu Gly Ala Cys Ser Ser Asp Tyr Ser His Thr Val Ala Gin Gin Arg 
145 150 155 160 

Arg Glu Glu Gin Asp Ala Tyr Asp He Thr Gly Asn Thr Leu Ser Val 
165 170 175 

Ala Ala Gly Arg Leu Ser Tyr Thr Leu Gly Leu Gin Gly Pro Cys Leu 
180 185 190 

Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala He His Leu Ala 
195 200 205 

Cys Arg Ser Leu Arg Ala Arg Glu Ser Asp Leu Ala Leu Ala Gly Gly 
210 215 220 

Val Asn Met Leu Leu Ser Ser Lys Thr Met He Met Leu Gly Arg He 
225 230 235 ' 240 

Gin Ala Leu Ser Pro Asp Gly His Cys Arg Thr Phe Asp Ala Ser Ala 
245 250 255 

Asn Gly Phe Val Arg Gly Glu Gly Cys Gly Met Val Val Leu Lys Arg 
260 265 270 

Leu Ser Asp Ala Gin Arg His Gly Asp Arg He Trp Ala Leu He Arg 
275 280 285 
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Gly Ser. Ala Met Asn Gin Asp Gly Arg Ser Thr Gly Leu Met Ala Pro 
* 290 295 300 

Asn Val Leu Ala Gin Glu Ala Leu Leu Arg Glu Ala Leu Gin Ser Ala 
305 310 315 320 

Arg Val Asp Ala Gly Ala lie Gly Tyr Val Glu Thr His Gly Thr Gly 
325 330 335 

Thr Ser Leu Gly Asp Pro He Glu Val Glu Ala Leu Arg Ala Val Leu 
340 345 350 

Gly Pro Ala Arg Ala Asp Gly Ser Arg Cys Val Leu Gly Ala Val Lys 
355 360 365 

Thr Asn Leu Gly His Leu Glu Gly Ala Ala Gly Val Ala Gly Leu He 
370 375 380 

Lys Ala Ala Leu Ala Leu His His Glu Leu He Pro Arg Asn Leu His 
385 390 395 400 

Phe His Thr Leu Asn Pro Arg He Arg He Glu Gly Thr Ala Leu Ala 
405 410 415 

Leu Ala Thr Glu Pro Val Pro Trp Pro Arg Ala Gly Arg Pro Arg Phe 
420 425 430 

Ala Gly Val Ser Ala Phe Gly Leu Ser Gly Thr Asn Val His Val Val 
435 440 445 

Leu Glu Glu Ala Pro Ala Thr Val Leu Ala Pro Ala Thr Pro Gly Arg 
450 455 460 

Ser Ala Glu Leu Leu Val Leu Ser Ala Lys Ser Ala Ala Ala Leu Asp 
465 470 475 480 

Ala Gin Ala Ala Arg Leu Ser Ala His He Ala Ala Tyr Pro Glu Gin 
485 490 495 

Gly Leu Gly Asp Val Ala Phe Ser Leu Val Ser Thr Arg Ser Pro Met 
500 505 510 

Glu His Arg Leu Ala Val Ala Ala Thr Ser Arg Glu Ala Leu Arg Ser 
515 520 525 

Ala Leu Glu Val Ala Ala Gin Gly Gin Thr Pro Ala Gly Ala Ala Arg 
530 535 540 

Gly Arg Ala Ala Ser Ser Pro Gly Lys Leu Ala Phe Leu Phe Ala Gly 
545 550 555 560 

Gin Gly Ala Gin Val Pro Gly Met Gly Arg Gly Leu Trp Glu Ala Trp 
565 570 575 

Pro Ala Phe Arg Glu Thr Phe Asp Arg Cys Val Thr Leu Phe Asp Arg 
580 585 590 

Glu Leu His Gin Pro Leu Cys Glu Val Met Trp Ala Glu Pro Gly Ser 
595 600 605 

Ser Arg Ser Ser Leu Leu Asp Gin Thr Ala Phe Thr Gin Pro Ala Leu 
610 615 620 

Phe Ala Leu Glu Tyr Ala Leu Ala Ala Leu Phe Arg Ser Trp Gly Val 
625 630 635 640 
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Glu Pro Glu Leu Val Ala Gly His Ser Leu Gly Glu Leu Val Ala Ala 
J545 650 655 

Cvs Val Ala Gly Val Phe Ser Leu Glu Asp Ala Val Arg Leu Val Val 
* 660 665 670 

Ala Arg Gly Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Ala Met Val 
675 ~ 680 685 

Ser He Ala Ala Pro Glu Ala Asp Val Ala Ala Ala Val Ala Pro His 
690 695 700 

Ala Ala Leu Val Ser He Ala Ala Val Asn Gly Pro Glu Gin Val Val 
705 710 715 720 

He Ala Gly Ala Glu Lys Phe Val Gin Gin He Ala Ala Ala Phe Ala 
725 730 735 

Ala Arg Gly Ala Arg Thr Lys Pro Leu His Val Ser His Ala Phe His 
740 745 750 

Ser Pro Leu Met Asp Pro Met Leu Glu Ala Phe Arg Arg Val Thr Glu 
755 760 765 

Ser Val Thr Tyr Arg Arg Pro Ser He Ala Leu Val Ser Asn Leu Ser 
770 775 780 

Glv Lvs Pro Cys Thr Asp Glu Val Ser Ala Pro Gly Tyr Trp Val Arg 
785 790 795 800 

His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Lys Ala Leu His 
805 810 815 

Ala Ala Gly Ala Gly Leu Phe Val Glu Val Gly Pro Lys Pro Thr Leu 
820 825 830 

Leu Gly Leu Val Pro Ala Cys Leu Pro Asp Ala Arg Pro Val Leu Leu 
835 840 845 

Pro Ala Ser Arg Ala Gly Arg Asp Glu Ala Ala Ser Ala Leu Glu Ala 
850 855 860 

Leu Gly Gly Phe Trp Val Val Gly Gly Ser Val Thr Trp Ser Gly Val 
865 870 875 880 

Phe Pro Ser Gly Gly Arg Arg Val Pro Leu Pro Thr Tyr Pro Trp Gin 
885 890 895 

Ara Glu Arg Tyr Trp He Glu Ala Pro Val Asp Arg Glu Ala Asp Gly 
900 905 910 

Thr Gly Arg Ala Arg Ala Gly Gly His Pro Leu Leu Gly Glu Val Phe 
915 ~ 920 925 

Ser Val Ser Thr His Ala Gly Leu Arg Leu Trp Glu Thr Thr Leu Asp 
930 935 940 

Arg Lys Arg Leu Pro Trp Leu Gly Glu His Arg Ala Gin Gly Glu Val 
945 950 955 960 

Val Phe Pro Gly Ala Gly Tyr Leu Glu Met Ala Leu Ser Ser Gly Ala 
965 970 975 

Glu He Leu Gly Asp Gly Pro He Gin Val Thr Asp Val Val Leu He 
980 985 990 
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Glu Thr Leu Thr Phe Ala Gly Asp Thr Ala Val Pro Val Gin Val Val 
995 1000 1005 

Thr Thr Glu Glu Arg Pro Gly Arg Leu Arg Phe Gin Val Ala Ser Arg 
1010 1015 1020 

Glu Pro Gly Glu Arg Arg Ala Pro Phe Arg lie His Ala Arg Gly Val 
1025 1030 1035 1040 

Leu Arg Arg He Gly Arg Val Glu Thr Pro Ala Arg Ser Asn Leu Ala 
1045 1050 1055 

Ala Leu Arg Ala Arg Leu His Ala Ala Val Pro Ala Ala Ala He Tyr 
1060 ~ 1065 1070 

Gly Ala Leu Ala Glu Met Gly Leu Gin Tyr Gly Pro Ala Leu Arg Gly 
1075 1080 1085 

Leu Ala Glu Leu Trp Arg Gly Glu Gly Glu Ala Leu Gly Arg Val Arg 
1090 1095 1100 

Leu Pro Glu Ala Ala Gly Ser Ala Thr Ala Tyr Gin Leu His Pro Val 
1105 1110 1115 1120 

Leu Leu Asp Ala Cys Val Gin Met He Val Gly Ala Phe Ala Asp Arg 
1125 H30 1135 

Asp Glu Ala Thr Pro Trp Ala Pro Val Glu Val Gly Ser Val Arg Leu 
1140 1145 1150 

Phe Gin Arg Ser Pro Gly Glu Leu Trp Cys His Ala Arg Val Val Ser 
1155 H60 1165 

Asp Gly Gin Gin Ala Ser Ser Arg Trp Ser Ala Asp Phe Glu Leu Met 
1170 1175 1180 

Asp Gly Thr Gly Ala Val Val Ala Glu He Ser Arg Leu Val Val Glu 
1185 H90 1195 1200 

Arg Leu Ala Ser Gly Val Arg Arg Arg Asp Ala Asp Asp Trp Phe Leu 
1205 1210 1215 

Glu Leu Asp Trp Glu Pro Ala Ala Leu Gly Gly Pro Lys He Thr Ala 
1220 1225 1230 

Gly Arg Trp Leu Leu Leu Gly Glu Gly Gly Gly Leu Gly Arg Ser Leu 
1235 1240 1245 

Cys Ser Ala Leu Lys Ala Ala Gly His Val Val Val His Ala Ala Gly 
1250 1255 1260 

Asp Asp Thr Ser Thr Ala Gly Met Arg Ala Leu Leu Ala Asn Ala Phe 
1265 1270 1275 1280 

Asp Gly Gin Ala Pro Thr Ala Val Val His Leu Ser Ser Leu Asp Gly 
1285 1290 1295 

Gly Gly Gin Leu Gly Pro Gly Leu Gly Ala Gin Gly Ala Leu Asp Ala 
1300 1305 1310 

Pro Arg Ser Pro Asp Val Asp Ala Asp Ala Leu Glu Ser Ala Leu Met 
1315 1320 1325 

Arg Gly Cys Asp Ser Val Leu Ser Leu Val Gin Ala Leu Val Gly Met 



1330 
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Asp Leu Arg Asn Ala Pro Arg Leu Trp Leu Leu Thr Arg Gly Ala Gin 
1345 1350 1355 1360 

Ala Ala Ala Ala Gly Asp Val Ser Val Val Gin Ala Pro Leu Leu Gly 
1365 1370 1375 

Leu Gly Arg Thr He Ala Leu Glu His Ala Glu Leu Arg Cys He Ser 
1380 1385 1390 

Val Asp Leu Asp Pro Ala Glu Pro Glu Gly Glu Ala Asp Ala Leu Leu 
1395 1400 1405 

Ala Glu Leu Leu Ala Asp Asp Ala Glu Glu Glu Val Ala Leu Arg Gly 
1410 1415 1420 

Gly Asp Arg Leu Val Ala Arg Leu Val ""^1 Leu Pro ^144^ 

Arg Arg Glu Lys Val Glu Pro Ala Gly Asp Arg Pro Phe Arg Leu Glu 
1445 1450 1455 

He Asn Glu Pro Gly Ala Leu Asp Gin Leu Val Leu Arg Ala Thr Gly 
1460 1465 1470 

Arg Arg Ala Pro Gly Pro Gly Glu Val Glu He Ser Val Glu Ala Ala 
1475 " 1480 1485 

Glv Leu Asp Ser He Asp He Gin Leu Ala Leu Gly Val Ala Pro Asn 
1490 1495 1500 

Asp Leu Pro Gly Glu Glu He Glu Pro Leu Val Leu Gly Ser Glu Cys 
1505 1510 1515 1520 

Ala Gly Arg He Val Ala Val Gly Glu Gly Val Asn Gly Leu Val Val 
1525 1530 1535 

Gly Gin Pro Val He Ala Leu Ala Ala Gly Val Phe Ala Thr His Val 
1540 1545 1550 

Thr Thr Ser Ala Thr Leu Val Leu Pro Arg Pro Leu Gly Leu Ser Ala 
1555 1560 1565 

Thr Glu Ala Ala Ala Met Pro Leu Ala Tyr Leu Thr Ala Trp Tyr Ala 
1570 1575 1580 

Leu Asp Lys Val Ala His Leu Gin Ala Gly Glu Arg Val Leu He His 
1585 1590 1595 1600 

Ala Glu Ala Gly Gly Val Gly Leu Cys Ala Val Arg Trp Ala Gin Arg 
1605 1610 1615 

Val Gly Ala Glu Val Tyr Ala Thr Ala Asp Thr Pro Glu Asn Arg Ala 
1620 1625 1630 

Tyr Leu Glu Ser Leu Gly Val Arg Tyr Val Ser Asp Ser Arg Ser Gly 
1635 * 1640 1645 

Arg Phe Val Thr Asp Val His Ala Trp Thr Asp Gly Glu Gly Val Asp 
1650 1655 1660 

Val Val Leu Asp Ser Leu Ser Gly Glu Arg He Asp Lys Ser Leu Met 
1665 1670 1675 1680 

Val Leu Arg Ala Cys Gly Arg Leu Val Lys Leu Gly Arg Arg Asp Asp 
1685 1690 1695 
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Cys Ala Asp Thr Gin Pro Gly Leu Pro Pro Leu Leu Arg Asn Phe Ser 
1700 1705 1710 

Phe Ser Gin Val Asp Leu Arg Gly Met Met Leu Asp Gin Pro Ala Arg 
1715 1720 1725 

lie Arg Ala Leu Leu Asp Glu Leu Phe Gly Leu Val Ala Ala Gly Ala 
1730 1735 1740 

lie Ser Pro Leu Gly Ser Gly Leu Arg Val Gly Gly Ser Leu Thr Pro 
1745 1750 1755 1760 

Pro Pro Val Glu Thr Phe Pro lie Ser Arg Ala Ala Glu Ala Phe Arg 
1765 1770 1775 

Arg Met Ala Gin Gly Gin His Leu. Gly Lys Leu Val Leu Thr Leu Asp 
1780 1785 1790 

I— Asp Pro Glu Val Arg lie Arg Ala Pro Ala Glu Ser Ser Val Ala Val 
^ ~ 1795 1800 1805 

Arg Ala Asp Gly Thr Tyr Leu Val Thr Gly Gly Leu Gly Gly Leu Gly 
;: C 1810 1815 1820 

M Leu Arg Val Ala Gly Trp Leu Ala Glu Arg Gly Ala Gly Gin Leu Val 
^ 1825 1830 1835 1840 

Leu Val Gly Arg Ser Gly Ala Ala Ser Ala Glu Gin Arg Ala Ala Val 
1845 1850 1855 



Ala Ala Leu Glu Ala His Gly Ala Arg Val Thr Val Ala Lys Ala Asp 
I860 1865 1870 



Val Ala Asp Arg Ser Gin lie Glu Arg Val Leu Arg Glu Val Thr Ala 
3 1875 1880 1885 

•« 

Ser Gly Met Pro Leu Arg Gly Val Val His Ala Ala Gly Leu Val Asp 
1890 1895 1900 

Asp Gly Leu Leu Met Gin Gin Thr Pro Ala Arg Phe Arg Thr Val Met 
1905 1910 1915 1920 

Gly Pro Lys Val Gin Gly Ala Leu His Leu His Thr Leu Thr Arg Glu ' 
1925 1930 1935 

Ala Pro Leu Ser Phe Phe Val Leu Tyr Ala Ser Ala Ala Gly Leu Phe 
1940 1945 1950 

Gly Ser Pro Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp 
1955 I960 1965 

Ala Leu Ser His His Arg Arg Ala Gin Gly Leu Pro Ala Leu Ser lie 
1970 1975 1980 

Asp Trp Gly Met Phe Thr Glu Val Gly Met Ala Val Ala Gin Glu Asn 
1985 1990 1995 2000 

Arg Gly Ala Arg Gin lie Ser Arg Gly Met Arg Gly He Thr Pro Asp 
2005 2010 2015 

Glu Gly Leu Ser Ala Leu Ala Arg Leu Leu Glu Gly Asp Arg Val Gin 
2020 2025 2030 

Thr Gly Val He Pro He Thr Pro Arg Gin Trp Val Glu Phe Tyr Pro 
2035 2040 2045 
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Ala Thr Ala Ala Ser Arg Arg Leu Ser Arg Leu Val Thr Thr Gin Arg 
2050 2055 2060 

Ala Val Ala Asp Arg Thr Ala Gly Asp Arg Asp Leu Leu Glu Gin Leu 
2065 2070 2075 2080 

Ala Ser Ala Glu Pro Ser Ala Arg Ala Gly Leu Leu Gin Asp Val Val 
2085 2090 2095 

Arg Val Gin Val Ser His Val Leu Arg Leu Pro Glu Asp Lys He Glu 
2100 2105 2110 

Val Asp Ala Pro Leu Ser Ser Met Gly Met Asp Ser Leu Met Ser Leu 
2115 2120 2125 

Glu Leu Arg Asn Arg He Glu Ala Ala Leu Gly Val Ala Ala Pro Ala 
2130 ~ 2135 2140 

Ala Leu Gly Trp Thr Tyr Pro Thr Val Ala Ala He Thr Arg Trp Leu 
2145 2150 2155 2160 

Leu Asp Asp Ala Leu Val Val Arg Leu Gly Gly Gly Ser Asp Thr Asp 
2165 2170 2175 

Glu Ser Thr Ala Ser Ala Gly Ser Phe Val His Val Leu Arg Phe Arg 
2180 2185 2190 

Pro Val Val Lys Pro Arg Ala Arg Leu Phe Cys Phe His Gly Ser Gly 
2195 2200 2205 

Gly Ser Pro Glu Gly Phe Arg Ser Trp Ser Glu Lys Ser Glu Trp Ser 
2210 2215 2220 

Asp Leu Glu He Val Ala Met Trp His Asp Arg Ser Leu Ala Ser Glu 
2225 2230 2235 2240 

Asd Ala Pro Gly Lys Lys Tyr Val Gin Glu Ala Ala Ser Leu He Gin 
2245 2250 2255 

His Tyr Ala Asp Ala Pro Phe Ala Leu Val Gly Phe Ser Leu Gly Val 
2260 2265 2270 

Arg Phe Val Met Gly Thr Ala Val Glu Leu Ala Ser Arg Ser Gly Ala 
2275 2280 2285 

Pro Ala Pro Leu Ala Val Phe Thr Leu Gly Gly Ser Leu He Ser Ser 
2290 2295 2300 

Ser Glu He Thr Pro Glu Met Glu Thr Asp He He Ala Lys Leu Phe 
2305 2310 2315 2320 

Phe Arg Asn Ala Ala Gly Phe Val Arg Ser Thr Gin Gin Val Gin Ala 
2325 2330 2335 

Asp Ala Arg Ala Asp Lys Val He Thr Asp Thr Met Val Ala Pro Ala 
2340 2345 2350 

Pro Gly Asp Ser Lys Glu Pro Pro Val Lys He Ala Val Pro lie Val 
2355 ** 2360 2365 

Ala He Ala Gly Ser Asp Asp Val He Val Pro Pro Ser Asp Val Gin 
2370 2375 2380 

Asp Leu Gin Ser Arg Thr Thr Glu Arg Phe Tyr Met His Leu Leu Pro 



2385 



2390 



2395 
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Glv Asp His Glu Phe Leu Val Asp Arg Gly Arg Glu He Met His He 
y 2.405 2410 2415 

Val Asp Ser His Leu Asn Pro Leu Leu Ala Ala Arg Thr Thr Ser Ser 
2420 2425 2430 

Gly Pro Ala Phe Glu Ala Lys 



<210> 8 
<211> 419 
<212> PRT 

<213> Sorangium cellulosum 



Met Thr Gin Glu Gin Ala Asn Gin Ser Glu Thr Lys Pro Ala Phe Asp 
1 5 10 15 . 

Phe Lys Pro Phe Ala Pro Gly Tyr Ala Glu Asp Pro Phe Pro Ala He 
20 25 30 

Glu Arg Leu Arg Glu Ala Thr Pro He Phe Tyr Trp Asp Glu Gly Arg 
35 40 45 

Ser Trp Val Leu Thr Arg Tyr His Asp Val Ser Ala Val Phe Arg Asp 
50 55 60 

Glu Arg Phe Ala Val Ser Arg Glu Glu Trp Glu Ser Ser Ala Glu Tyr 
65 70 75 80 

Ser Ser Ala He Pro Glu Leu Ser Asp Met Lys Lys Tyr Gly Leu Phe 
85 90 95 

Gly Leu Pro Pro Glu Asp His Ala Arg Val Arg Lys Leu Val Asn Pro 
100 105 110 

Ser Phe Thr Ser Arg Ala He Asp Leu Leu Arg Ala Glu He Gin Arg 
115 ~ 120 125 

Thr Val Asp Gin Leu Leu Asp Ala Arg Ser Gly Gin Glu Glu Phe Asp 
130 135 140 

Val Val Arg Asp Tyr Ala Glu Gly He Pro Met Arg Ala He Ser Ala 
145 150 155 160 

Leu Leu Lys Val Pro Ala Glu Cys Asp Glu Lys Phe Arg Arg Phe Gly 
165 170 175 

Ser Ala Thr Ala Arg Ala Leu Gly Val Gly Leu Val Pro Gin Val Asp 
180 185 190 

Glu Glu Thr Lys Thr Leu Val Ala Ser Val Thr Glu Gly Leu Ala Leu 
195 200 205 

Leu His Asp Val Leu Asp Glu Arg Arg Arg Asn Pro Leu Glu Asn Asp 
210 215 220 

Val Leu Thr Met Leu Leu Gin Ala Glu Ala Asp Gly Ser Arg Leu Ser 
225 230 235 240 

Thr Lys Glu Leu Val Ala Leu Val Gly Ala He He Ala Ala Gly Thr 
245 250 255 

Asp Thr Thr He Tyr Leu He Ala Phe Ala Val Leu Asn Leu Leu Arg 



2435 



260 



265 



4-30582A 




72 



Ser Pro Glu Ala Leu Glu Leu Val Lys Ala Glu Pro Gly Leu Met Arg 
275 280 285 

Asn Ala Leu Asp Glu Val Leu Arg Phe Asp Asn lie Leu Arg lie Gly 
290 295 300 

Thr Val Arg Phe Ala Arg Gin Asp Leu Glu Tyr Cys Gly Ala Ser lie 
305 310 315 320 

Lys Lys Gly Glu Met Val Phe Leu Leu lie Pro Ser Ala Leu Arg Asp 
325 330 335 

Gly Thr Val Phe Ser Arg Pro Asp Val Phe Asp Val Arg Arg Asp Thr 
340 345 350 

Gly Ala Ser Leu Ala Tyr Gly Arg Gly Pro His Val Cys Pro Gly Val 
355 360 365 

Ser Leu Ala Arg Leu Glu Ala Glu He Ala Val Gly Thr He Phe Arg 
370 ~ 375 380 

Arg Phe Pro Glu Met Lys Leu Lys Glu Thr Pro Val Phe Gly Tyr His 
385 390 395 400 

Pro Ala Phe Arg Asn He Glu Ser Leu Asn Val lie Leu Lys Pro Ser 
405 410 415 

Lys Ala Gly 



<210> 9 
<211> 607 
<212> PRT 

<213> Sorangium cellulosum 
<400> 9 

Ala Ser Leu Asp Ala Leu Phe Ala Arg Ala Thr Ser Ala Arg Val Leu 
15 10 15 

Asp Asp Gly His Gly Arg Ala Thr Glu Arg His Val Leu Ala Glu Ala 
20 25 30 

Arg Gly He Glu Asp Leu Arg Ala Leu Arg Glu His Leu Arg He Gin 
35 40 45 

Glu Gly Gly Pro Ser Phe His Cys Met Cys Leu Gly Asp Leu Thr Val 
50 55 60 

Glu Leu Leu Ala His Asp Gin Pro Leu Ala Ser He Ser Phe His His 
65 70 75 80 

Ala Arg Ser Leu Arg His Pro Asp Trp Thr Ser Asp Ala Met Leu Val 
85 90 95 

Asp Gly Pro Ala Leu Val Arg Trp Leu Ala Ala Arg Gly Ala Pro Gly 
100 105 110 

Pro Leu Arg Glu Tyr Glu Glu Glu Arg Glu Arg Ala Arg Thr Ala Gin 
115 ~ 120 125 

Glu Ala Arg Arg Leu Trp Leu Ala Ala Ala Pro Pro Cys Phe Ala Pro 
130 135 140 

Asp Leu Pro Arg Phe Glu Asp Asp Ala Asn Gly Leu Pro Leu Gly Pro 
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150 155 160 



Met Ser Pro Glu Val Ala Glu Ala Glu Arg Arg Leu Arg Ala Ser Tyr 
165 170 175 

Ala Thr Pro Glu Leu Ala Cys Ala Ala Leu Leu Ala Trp Leu Gly Thr 
180 185 190 

Glv Ala Gly Pro Trp Ser Gly Tyr Pro Ala Tyr Glu Met Leu Pro Glu 
195 " 200 205 

Asn Leu Leu Leu Gly Phe Gly Leu Pro Thr Ala lie Ala Ala Ala Ser 
210 215 220 

Ala Pro Gly Thr Ser Glu Ala Ala Leu Arg Gly Ala Ala Arg Leu Phe 
225 230 235 240 

Ala Ser Trp Glu Val Val Ser Ser Lys Lys Ser Gin Leu Gly Asn He 
245 250 255 

Pro Glu Ala Leu Trp Glu Arg Leu Arg Thr He Val Arg Ala Met Gly 
260 265 270 

Asn Ala Asp Asn Leu Ser Arg Phe Glu Arg Ala Glu Ala He Ala Ala 
275 280 285 

Glu Val Arg Arg Leu Arg Ala Gin Pro Ala Pro Phe Ala Ala Gly Ala 
290 ~ 295 300 

Gly Leu Ala Val Ala Gly Val Ser Ser Ser Gly Arg Leu Ser Gly Leu 
305 310 315 320 

Val Thr Asp Gly Asp Ala Leu Tyr Ser Gly Asp Gly Asn Asp He Val 
325 330 335 

Met Phe Gin Pro Gly Arg He Ser Pro Val Val Leu Leu Ala Gly Thr 
340 " 345 350 

Asp Pro Phe Phe Glu Leu Ala Pro Pro Leu Ser Gin Met Leu Phe Val 
355 360 365 

Ala His Ala Asn Ala Gly Thr He Ser Lys Val Leu Thr Glu Gly Ser 
370 375 380 

Pro Leu He Val Met Ala Arg Asn Gin Ala Arg Pro Met Ser Leu Val 
385 390 395 400 

His Ala Arg Gly Phe Met Ala Trp Val Asn Gin Ala Met Val Pro Asp 
405 410 415 

Pro Glu Arg Gly Ala Pro Phe Val Val Gin Arg Ser Thr He Met Glu 
420 425 430 

Phe Glu His Pro Thr Pro Arg Cys Leu His Glu Pro Ala Gly Ser Ala 
435 440 445 

Phe Ser Leu Ala Cys Asp Glu Glu His Leu Tyr Trp Cys Glu Leu Ser 
450 455 460 

Ala Gly Arg Leu Glu Leu Trp Arg His Pro His His Arg Pro Gly Ala 
465 470 475 480 

Pro Ser Arg Phe Ala Tyr Leu Gly Glu His Pro He Ala Ala Thr Trp 
485 490 495 

Tyr Pro Ser Leu Thr Leu Asn Ala Thr His Val Leu Trp Ala Asp Pro 
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500 505 510 

Asp Arg Arg Ala lie Leu Gly Val Asp Lys Arg Thr Gly Val Glu Pro 
515 520 525 

He Val Leu Ala Glu Thr Arg His Pro Pro Ala His Val Val Ser Glu 
530 535 540 

Asp Arg Asp He Phe Ala Leu Thr Gly Gin Pro Asp Ser Arg Asp Trp 
545 550 555 560 

His Val Glu His He Arg Ser Gly Ala Ser Thr Val Val Ala Asp Tyr 
565 ~ 570 575 

Gin Arg Gin Leu Trp Asp Arg Pro Asp Met Val Leu Asn Arg Arg Gly 
580 "* 585 590 

Leu Phe Phe Thr Thr Asn Asp Arg He Leu Thr Leu Ala Arg Ser 
595 600 605 



<210> 10 
<211> 423 
<212> PRT 

<213> Sorangium cellulosum 

<400> 10 , ^ 

Met Gly Ala Leu He Ser Val Ala Ala Pro Gly Cys Ala Leu Gly Gly 
X 5 10 15 

Ala Glu Glu Glu Gly Gin Pro Gly Gin Asp Ala Gly Ala Gly Ala Leu 
20 25 30 

Ala Pro Ala Arg Glu Val Met Ala Ala Glu Val Ala Ala Gly Gin Met 
35 40 45 

Pro Gly Ala Val Trp Leu Val Ala Arg Gly Asp Asp Val His Val Asp 
50 55 60 

Ala Val Gly Val Thr Glu Leu Gly Gly Ser Ala Pro Met Arg Arg Asp 
65 70 75 80 

Thr He Phe Arg He Ala Ser Met Thr Lys Ala Val Thr Ala Thr Ala 
85 90 95 

Val Met Met Leu Val Glu Glu Gly Lys Leu Asp Leu Asp Ser Pro Val 
100 105 . HO 

Asp Arg Trp Leu Pro Glu Leu Ala Asn Arg Lys Val Leu Ala Arg He 
115 120 125 

Asp Gly Pro He Asp Glu Thr Val Pro Ala Glu Arg Pro He Thr Val 
130 135 140 

Arg Asp Leu Met Thr Phe Thr Met Gly Phe Gly He Ser Phe Asp Ala 
145 150 155 160 

Ser Ser Pro He Gin Arg Ala He Asp Glu Leu Gly Leu Val Asn Ala 
165 170 175 

Gin Pro Val Pro Met Thr Pro His Gly Pro Asp Glu Trp He Arg Arg 
180 185 190 

Leu Gly Thr Leu Pro Leu Met His Gin Pro Gly Ala Gin Trp Met Tyr 
195 200 205 
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Asn Thr Gly Ser Leu Val Gin Gly Val Leu Val Gly Arg Ala Ala Asp 
210 . 215 220 

Gin Gly Phe Asp Ala Phe Val Arg Glu Arg He Leu Ala Pro Leu Gly 
225 230 235 240 

Met Arg Asp Thr Asp Phe His Val Pro Ala Asp Lys Leu Ala Arg Phe 
245 250 255 

Ala Gly Cys Gly Tyr Phe Thr Asp Glu Gin Thr Gly Glu Lys Thr Arg 
260 265 270 

Met Asp Arg Asp Gly Ala Glu Ser Ala Tyr Ala Ser Pro Pro Ala Phe 
275 280 285 

Pro Ser Gly Ala Ala Gly Leu Val Ser Thr Val Asp Asp Tyr Leu Leu 
290 295 300 

Phe Ala Arg Met Leu Met Asn Gly Gly Val His Glu Gly Arg Arg Leu 
305 " 310 315 320 

Leu Ser Ala Ala Ser Val Arg Glu Met Thr Ala Asp His Leu Thr Pro 
325 330 335 

Ala Gin Lys Ala Ala Ser Ser Phe Phe Pro Gly Phe Phe Glu Thr His 
340 345 350 

Glv Trp Gly Tyr Gly Met Ala Val Val Thr Ala Pro Asp Ala Val Ser 
355 "* 360 365 

Glu Val Pro Gly Arg Tyr Gly Trp Asp Gly Gly Phe Gly Thr Ser Trp 
370 375 380 

He Asn Asp Pro Gly Arg Glu Leu He Gly lie Val Met Thr Gin Ser 
385 390 395 400 

Ala Gly Phe Leu Phe Ser Gly Ala Leu Glu Arg Phe Trp Arg Ser Val 
405 410 415 

Tyr Val Ala Thr Glu Ser Ala 
420 



<210> 11 
<211> 713 
<212> PRT 

<213> Sorangium cellulosum 

Met^His^Gly Leu Thr Glu Arg Gin Val Leu Leu Ser Leu Val Thr Leu 
1 5 10 15 

Ala Leu He Leu Val Thr Ala Arg Ala Ser Gly Glu Leu Ala Arg Arg 
20 25 30 

Leu Arg Gin Pro Glu Val Leu Gly Glu Leu Phe Gly Gly Val Val Leu 
35 40 45 

Gly Pro Ser Val Val Gly Ala Leu Ala Pro Gly Phe His Arg Ala Leu 
50 55 60 

Phe Gin Glu Pro Ala Val Gly Val Val Leu Ser Gly He Ser Trp lie 
65 70 75 80 

Gly Ala Leu Leu Leu Leu Leu Met Ala Gly He Glu Val Asp Val Gly 
85 90 95 
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He Leu Arg Lys Glu Ala Arg Pro Gly Ala Leu Ser Ala Leu Gly Ala 
100 * 105 110 

He Ala Pro Pro Leu Ala Ala Gly Ala Ala Phe Ser Ala Leu Val Leu 
115 120 125 

Asp Arg Pro Leu Pro Ser Gly Leu Phe Leu Gly He Val Leu Ser Val 
130 135 140 

Thr Ala Val Ser Val He Ala Lys Val Leu He Glu Arg Glu Ser Met 
145 150 155 160 

Arg Arg Ser Tyr Ala Gin Val Thr Leu Ala Ala Gly Val Val Ser Glu 
165 170 175 

Val Ala Ala Trp Val Leu Val Ala Met Thr Ser Ser Ser Tyr Gly Ala 
180 185 190 

Ser Pro Ala Leu Ala Val Ala Arg Ser Ala Leu Leu Ala Ser Gly Phe 
195 200 205 

Leu Leu Phe Met Val Leu Val Gly Arg Arg Leu Thr His Leu Ala Met 
210 215 220 

Arg Trp Val Ala Asp Ala Thr Arg Val Ser Lys Gly Gin Val Ser Leu 
225 230 235 240 

Val Leu Val Leu Thr Phe Leu Ala Ala Ala Leu Thr Gin Arg Leu Gly 
245 250 255 

Leu His Pro Leu Leu Gly Ala Phe Ala Leu Gly Val Leu Leu Asn Ser 
260 265 270 

Ala Pro Arg Thr Asn Arg Pro Leu Leu Asp Gly Val Gin Thr Leu Val 
275 280 285 

Ala Gly Leu Phe Ala Pro Val Phe Phe Val Leu Ala Gly Met Arg Val 
290 295 300 

Asp Val Ser Gin Leu Arg Thr Pro Ala Ala Trp Gly Thr Val Ala Leu 
305 310 315 320 

Leu Leu Ala Thr Ala Thr Ala Ala Lys Val Val Pro Ala Ala Leu Gly 
325 330 335 

Ala Arg Leu Gly Gly Leu Arg Gly Ser Glu Ala Ala Leu Val Ala Val 
340 345 350 

Gly Leu Asn Met Lys Gly Gly Thr Asp Leu He Val Ala He Val Gly 
355 360 365 

Val Glu Leu Gly Leu Leu Ser Asn Glu Ala Tyr Thr Met Tyr Ala Val 
370 375 380 

Val Ala Leu Val Thr Val Thr Ala Ser Pro Ala Leu Leu He Trp Leu 
385 390 395 400 

Glu Lys Arg Ala Pro Pro Thr Gin Glu Glu Ser Ala Arg Leu Glu Arg 
405 410 415 

Glu Glu Ala Ala Arg Arg Ala Tyr He Pro Gly Val Glu Arg He Leu 
420 425 430 

Val Pro He Val Ala His Ala Leu Pro Gly Phe Ala Thr Asp He Val 
435 440 445 
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Glu Ser lie Val Ala Ser Lys Arg Lys Leu Gly Glu Thr Val Asp lie 
450 455 460 

Thr Glu Leu Ser Val Glu Gin Gin Ala Pro Gly Pro Ser Arg Ala Ala 
465 470 475 480 

Glv Glu Ala Ser Arg Gly Leu Ala Arg Leu Gly Ala Arg Leu Arg Val 
485 490 495 

Gly He Trp Arg Gin Arg Arg Glu Leu Arg Gly Ser He Gin Ala He 
500 ~ 505 510 

Leu Arg Ala Ser Arg Asp His Asp Leu Leu Val He Gly Ala Arg Ser 
515 ^ 520 525 

Pro Ala Arg Ala Arg Gly Met Ser Phe Gly Arg Leu Gin Asp Ala He 
530 535 540 

Val Gin Arg Ala Glu Ser Asn Val Leu Val Val Val Gly Asp Pro Pro 
545 550 555 560 

Ala Ala Glu Arg Ala Ser Ala Arg Arg He Leu Val Pro He He Gly 
565 570 575 

Leu Glu Tyr Ser Phe Ala Ala Ala Asp Leu Ala Ala His Val Ala Leu 
580 585 590 

Ala Trp Asp Ala Glu Leu Val Leu Leu Ser Ser Ala Gin Thr Asp Pro 
595 600 605 

Gly Ala Val Val Trp Arg Asp Arg Glu Pro Ser Arg Val Arg Ala Val 
610 615 620 

Ala Arg Ser Val Val Asp Glu Ala Val Phe Arg Gly Arg Arg Leu Gly 
625 " 630 635 640 

Val Arg Val Ser Ser Arg Val His Val Gly Ala His Pro Ser Asp Glu 
645 650 655 

He Thr Arg Glu Leu Ala Arg Ala Pro Tyr Asp Leu Leu Val Leu Gly 
660 665 670 

Cys Tyr Asp His Gly Pro Leu Gly Arg Leu Tyr Leu Gly Ser Thr Val 
675 " 680 685 

Glu Ser Val Val Val Arg Ser Arg Val Pro Val Ala Leu Leu Val Ala 
690 695 700 

His Gly Gly Thr Arg Glu Gin Val Arg 
705 710 

<210> 12 
<211> 126 
<212> PRT 

<213> Sorangium cellulosum 
<400> 12 

Met Asp Lys Pro He Gly Arg Thr Arg Cys Ala lie Ala Glu Gly Tyr 
15 10 15 

He Pro Gly Gly Ser Asn Gly Pro Glu Pro Gin Met Thr Ser His Glu 
20 25 30 

Thr Ala Cys Leu Leu Asn Ala Ser Asp Arg Asp Ala Gin Val Ala He 
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35 40 45 

Thr Val Tyr Phe Ser Asp Arg Asp Pro Ala Gly Pro Tyr Arg Val Thr 
50 55 60 ! 

Val Pro Ala Arg Arg Thr Arg His Val Arg Phe Asn Asp Leu Thr Glu 
65 " 70 75 80 

Pro Glu Pro lie Pro Arg Asp Thr Asp Tyr Ala Ser Val lie Glu Ser 
85 90 95 

Asp Ala Pro lie Val Val Gin His Thr Arg Leu Asp Ser Arg Gin Ala 
100 105 110 

Glu Asn Ala Leu Leu Ser Thr He Ala Tyr Thr Asp Arg Glu 
115 120 125 



<210> 13 
<211> 149 
<212> PRT 

<213> Sorangium cellulosum 
<400> 13 

Met Lys His Val Asp Thr Gly Arg Arg Phe Gly Arg Arg He Gly His 
1 5 10 15 

Thr Leu Gly Leu Leu Ala Ser Met Ala Leu Ala Gly Cys Gly Gly Pro 
20 25 30 

Ser Glu Lys Thr Val Gin Gly Thr Arg Leu Ala Pro Gly Ala Asp Ala 
35 40 45 

Arg Val Thr Ala Asp Val Asp Pro Asp Ala Ala Thr Thr Arg Leu Ala 
50 55 60 

Val Asp Val Val His Leu Ser Pro Pro Glu Arg Leu Glu Ala Gly Ser 
65 70 75 80 

Glu Arg Phe Val Val Trp Gin Arg Pro Ser Pro Glu Ser Pro Trp Arg 
85 90 95 

Arg Val Gly Val Leu Asp Tyr Asn Ala Asp Ser Arg Arg Gly Lys Leu 
100 105 110 

Ala Glu Thr Thr Val Pro Tyr Ala Asn Phe Glu Leu Leu He Thr Ala 
115 120 125 

Glu Lys Gin Ser Ser Pro Gin Ser Pro Ser Ser Ala Ala Val He Gly 
130 135 140 

Pro Thr Ser Val Gly 
145 



<210> 14 
<211> 184 
<212> PRT 

<213> Sorangium cellulosum 

<400> 14 , M „ 

Val Thr Ser Glu Glu Val Pro Gly Ala Ala Leu Gly Ala Gin Ser Ser 
1 5 10 15 



Leu Val Arg Ala Gin His Ala Ala Arg His Val Arg Pro Cys Thr Arg 
20 25 30 
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Ala Glu Glu Pro Pro Ala Leu Met His Gly Leu Thr Glu Arg Gin Val 
35 40 45 

Leu Leu Ser Leu Val Ala Leu Ala Leu Val Leu Leu Thr Ala Arg Ala 
50 55 60 

Phe Gly Glu Leu Ala Arg Arg Leu Arg Gin Pro Glu Val Leu Gly Glu 
65 70 75 80 

Leu Phe Gly Gly Val Val Leu Gly Pro Ser Val Val Gly Ala Leu Ala 
85 90 95 

Pro Gly Phe His Arg Val Leu Phe Gin Asp Pro Ala Val Gly Val Val 
100 105 HO 

Leu Ser Gly He Ser Trp He Gly Ala Leu Val Leu Leu Leu Met Ala 
115 120 125 

Gly He Glu Val Asp Val Ser He Leu Arg Lys Glu Ala Arg Pro Gly 
130 135 140 

Ala Leu Ser Ala Leu Gly Ala He Ala Pro Pro Leu Arg Thr Pro Gly 
145 150 155 160 

Pro Leu Val Gin Arg Met Gin Gly Ala Phe Thr Trp Asp Leu Asp Val 
165 170 175 

Ser Pro Arg Arg Ser Ala Gin Ala 



<210> 15 
<211> 145 
<212> PRT 

<213> Sorangium cellulosum 



Val Asn Ala Pro Cys Met Arg Cys Thr Ser Gly Pro Gly Val Arg Ser 
15 10 15 

Glv Gly Ala He Ala Pro Ser Ala Glu Ser Ala Pro Gly Arg Ala Ser 
20 25 30 

Leu Arq Arg Met Leu Thr Ser Thr Ser He Pro Ala Met Ser Ser Arg 
35 40 45 

Thr Ser Ala Pro He Gin Glu Met Pro Glu Ser Thr Thr Pro Thr Ala 
50 55 60 

Gly Ser Trp Lys Arg Thr Arg Trp Asn Pro Gly Ala Ser Ala Pro Thr 
65 ^ 70 75 80 

Thr Asd Gly Pro Ser Thr Thr Pro Pro Lys Ser Ser Pro Ser Thr Ser 
• 85 90 95 

Gly Trp Arg Ser Arg Arg Ala Ser Ser Pro Lys Ala Arg Ala Val Arg 
100 105 HO 

Arg Thr Ser Ala Arg Ala Thr Ser Glu Ser Arg Thr Cys Arg Ser Val 
115 *" 120 125 

Arg Pro Cys He Arg Ala Gly Gly Ser Ser Ala Arg Val Gin Gly Arg 



180 



130 



135 



Thr 
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145 

<210> 16 
<211> 185 
<212> PRT 

<213> Sorangium cellulosum 
<400> 16 

Val Leu Ala Pro Pro Ala Asp He Arg Pro Pro Ala Ala Ala Gin Leu 
! 5 10 15 

Glu Pro Asp Ser Pro Asp Asp Glu Ala Asp Glu Ala Asp Glu Ala Leu 
20 25 30 

Arg Pro Phe Arg Asp Ala He Ala Ala Tyr Ser Glu Ala Val Arg Trp 
35 40 45 

Ala Glu Ala Ala Gin Arg Pro Arg Leu Glu Ser Leu Val Arg Leu Ala 
50 55 60 

He Val Arg Leu Gly Lys Ala Leu Asp Lys Val Pro Phe Ala His Thr 
65 70 75 80 

Thr Ala Gly Val Ser Gin He Ala Gly Arg Leu Gin Asn Asp Ala Val 
85 90 95 

Trp Phe Asp Val Ala Ala Arg Tyr Ala Ser Phe Arg Ala Ala Thr Glu 
100 105 HO 

His Ala Leu Arg Asp Ala Ala Ser Ala Met Glu Ala Leu Ala Ala Gly 
115 120 125 

Pro Tyr Arg Gly Ser Ser Arg Val Ser Ala Ala Val Gly Glu Phe Arg 
130 ~ 135 140 

Gly Glu Ala Ala Arg Leu His Pro Ala Asp Arg Val Pro Ala Ser Asp 
145 150 155 160 

Gin Gin He Leu Thr Ala Leu Arg Ala Ala Glu Arg Ala Leu He Ala 
165 170 175 

Leu Tyr Thr Ala Phe Ala Arg Glu Glu 
180 185 

<210> 17 
<211> 146 
<212> PRT 

<213> Sorangium cellulosum 

<400> 17 , - , » r 

Met Ala Asp Ala Ala Ser Arg Ser Ala Cys Ser Val Ala Ala Arg Lys 
1 5 10 15 

Leu Ala Tyr Arg Ala Ala Thr Ser Asn Gin Thr Ala Ser Phe Trp Ser 
20 25 30 

Leu Pro Ala He Trp Glu Thr Pro Ala Val Val Cys Ala Lys Gly Thr 
35 40 45 

Leu Ser Ser Ala Leu Pro Ser Arg Thr He Ala Ser Arg Thr Arcj Leu 
50 55 60 

Ser Ser Arg Gly Arg Cys Ala Ala Ser Ala His Arg Thr Ala Ser Glu 
65 70 75 80 
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Tyr Ala Ala lie Ala Ser Arg Asn Gly Arg Ser Ala Ser Ser Ala Ser 
85 90 95 

Ser Ala Ser Ser Ser Gly Glu Ser Gly Ser Ser Trp Ala Ala Ala Gly 
100 105 110 

Gly Arg Met Ser Ala Gly Gly Ala Ser Thr Gly Glu Val Tyr Glu Gin 
115 120 125 

Ala Pro Arg Leu Arg Leu Ala Gin Ser Val Ala Ala Arg Arg Arg Asp 
130 " 135 140 

Pro Thr 
145 

<210> 18 
<211> 288 
<212> PRT 

<213> Sorangium cellulosum 

Vai°Thr 8 Val Ser Ser Met Pro Arg Ser Trp Ser Ser Arg Val Arg Thr 
1 5 10 I 5 

Val Val Thr Ala Leu Gly Cys Ala Arg Arg Leu Ser Gly Ser lie Ser 
20 25 30 

Arg Leu Arg Arg His Pro Glu Ala Gly Arg Ala Pro Arg Ser Arg Leu 
35 40 45 

Arg Ala Trp Arg Arg Leu Pro Gin His lie Ser Ser Pro Trp Arg His 
50 55 60 

Leu Pro Pro Gly Ala Arg Val Gly Thr Ser Cys Pro Ala Asp Arg Arg 
65 70 75 80 

He Leu Pro Ser His Arg Thr Ala Asp Leu Gly Thr Ser Gly Gly Thr 
85 90 95 

Leu Val Ala Arg Met Ser Gly His Val Ala Arg Asn Pro His Ala Ala 
100 105 11° 

Val Leu Val Gly Asp Gly Ser Ala Arg Gly Arg Arg Arg Leu Ser Asn 
115 120 125 

Arg Arg Ala Glu Arg Arg Val Ser Asp Val Thr Cys Arg Glu Gly Gly 
130 135 140 

Glu Ala Met Gin Lys He Ala Gly Lys Leu Val Val Gly Leu He Ser 
145 150 155 160 

Val Ser Gly Met Ser Leu Leu Ala Ala Cys Gly Gly Glu Lys Arg Ser 
165 170 175 

Gly Gly Glu Ala Gin Thr Pro Gly Gly Ala Gin Gly Glu Ala Pro Val 
180 185 I 90 

Pro Val Gly Ser Ala Val Asp Ser He Val Ala Ala Arg Cys Asp Arg 
195 200 205 

Glu Ala Arg Cys Asn Asn He Gly Gin Asp Arg Glu Tyr Ser Ser Lys 
210 215 220 

Asp Ala Cys Ser Asn Lys He Arg Ser Glu Trp Arg Asp Glu Leu Thr 
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230 235 240 



Phe Gly Glu Cys Pro Gly Gly lie Asp Ala Lys Gin Leu Asn Glu Cys 
245 250 255 

Leu Glu Gly lie Arg Asn Glu Gly Cys Gly Asn Pro Phe Asp Thr Leu 
260 265 270 

Gly Arg Val Val Ala Cys Arg Ser Ser Asp Leu Cys Arg Asp Ala Arg 
275 280 285 



<210> 19 
<211> 288 
<212> PRT 

<213> Sorangium cellulosum 

<400> 19 . 
Val Thr Val Ser Ser Met Pro Arg Ser Trp Ser Ser Arg Val Arg Thr 
x 5 10 15 

Val Val Thr Ala Leu Gly Cys Ala Arg Arg Leu Ser Gly Ser He Ser 
£ 20 25 30 

Arg Leu Arg Arg His Pro Glu Ala Gly Arg Ala Pro Arg Ser Arg Leu 

!>* 35 40 45 

■ ^ Arg Ala Trp Arg Arg Leu Pro Gin His He Ser Ser Pro Trp Arg His 
50 55 60 



Leu Pro Pro Gly Ala Arg Val Gly Thr Ser Cys Pro Ala Asp Arg Arg 
65 70 75 80 



He Leu Pro Ser His Arg Thr Ala Asp Leu Gly Thr Ser Gly Gly Thr 
J 85 90 95 



Leu Val Ala Arg Met Ser Gly His Val Ala Arg Asn Pro His Ala Ala 
100 105 HO 

Val Leu Val Gly Asp Gly Ser Ala Arg Gly Arg Arg Arg Leu Ser Asn 
115 120 125 

Ara Arg Ala Glu Arg Arg Val Ser Asp Val Thr Cys Arg Glu Gly Gly « 
130 135 140 

Glu Ala Met Gin Lys He Ala Gly Lys Leu Val Val Gly Leu He Ser 
145 150 155 160 

Val Ser Gly Met Ser Leu Leu Ala Ala Cys Gly Gly Glu Lys Arg Ser 
165 170 175 

Glv Gly Glu Ala Gin Thr Pro Gly Gly Ala Gin Gly Glu Ala Pro Val 
180 185 190 

Pro Val Gly Ser Ala Val Asp Ser He Val Ala Ala Arg Cys Asp Arg 
195 200 205 

Glu Ala Arg Cys Asn Asn He Gly Gin Asp Arg Glu Tyr Ser Ser Lys 
210 "* 215 220 

Asp Ala Cys Ser Asn Lys He Arg Ser Glu Trp Arg Asp Glu Leu Thr 
225 230 235 240 
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Phe Gly Glu Cys Pro Gly Gly lie Asp Ala Lys Gin Leu Asn Glu Cys 
245 250 255 

Leu Glu Gly He Arg Asn Glu Gly Cys Gly Asn Pro Phe Asp Thr Leu 
260 265 270 

Glv Arg Val Val Ala Cys Arg Ser Ser Asp Leu Cys Arg Asp Ala Arg 
275 280 285 



<210> 20 
<211> 155 
<212> PRT 

<213> Sorangium cellulosum 

Met°Asp°Pro Arg Ala Arg Arg Glu Lys Arg Pro Ser Leu Leu Asp Ser 
15 10 15 

Arg Gly Arg Gin Pro Lys Arg Ser Gin Gin Gly Gly His Met Glu Lys 
20 25 30 

Pro He Gly Arg Thr Arg Trp Ala He Ala Glu Gly Tyr He Pro Gly 
35 40 45 

Arg Ser Asn Gly Pro Glu Pro Gin Met Thr Ser His Glu Thr Ala Cys 
50 55 60 

Leu Leu Asn Ala Ser Asp Arg Asp Ala Gin Val Ala He Thr Val Tyr 
65 70 75 80 

Phe Ser Asp Arg Asp Pro Ala Gly Pro Tyr Arg Val Thr Val Pro Ala 
85 9° 95 

Arg Arg Thr Arg His Val Arg Phe Asn Asp Leu Thr Glu Pro Glu Pro 
100 1° 5 110 

He Pro Arg Asp Thr Asp Tyr Ala Ser Val He Glu Ser Asp Val Pro 
115 120 125 

He Val Val Gin His Thr Arg Leu Asp Ser Arg Gin Ala Glu Asn Ala 
130 135 140 

Leu He Ser Thr He Ala Tyr Thr Asp Arg Glu 
145 150 155 

<210> 21 
<211> 156 
<212> PRT 

<213> Sorangium cellulosum 

Vai°Arg 1 Arg Ser Arg Trp Gin Met Lys His Val Asp Thr Gly Arg Arg 
1 5 1° 15 

Val Gly Arg Arg He Gly Leu Thr Leu Gly Leu Leu Ala Ser Met Ala 
20 25 30 

Leu Ala Gly Cys Gly Gly Pro Ser Glu Lys He Val Gin Gly Thr Arg 
35 40 45 

Leu Ala Pro Gly Ala Asp Ala His Val Ala Ala Asp Val Asp Pro Asp 
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50 55 



60 



Ala 
65 


Ala 


Thr 


Thr 


Arg 


Leu 
70 


Ala 


Val Asp 


Val 


Val 
75 


His 


Leu 


Ser 


Pro 


Pro 
fin 

O \J 


Glu 


Arg 


lie 


Glu 


Ala 
85 


Gly 


Ser 


Glu Arg 


Phe 
90 


Val 


Val 


Trp 


Gin Arg 
95 


Pro 


Ser 


Ser 


Glu 


Ser 
100 


Pro 


Trp 


Gin Arg Val 
105 


Gly Val 


Leu 


Asp 


Tyr 
110 


Asn 


Ala 


Ala 


Ser Arg Arg Gly 

lis 


Lys 


Leu 


Ala Glu 
120 


Thr 


Thr 


Val 


Pro 
125 


His 


Ala 


Asn 


Phe 


Glu 
130 


Leu 


Leu 


lie 


Thr 


Val 
135 


Glu Lys 


Gin 


Ser 


Ser 
140 


Pro 


Gin 


Ser 


Pro 


Ser 
145 


Ser 


Ala 


Ala 


Val 


He 
150 


Gly 


Pro Thr 


Ser 


Val 
155 


Gly 











<210> 22 
<211> 305 
<212> PRT 

<213> Sorangium cellulosum 

Met°Glu 2 Lys Glu Ser Arg He Ala He Tyr Gly Ala He Ala Ala Asn 
x u 5 10 15 

Val Ala He Ala Ala Val Lys Phe He Ala Ala Ala Val Thr Gly Ser 
20 25 30 

Ser Ala Met Leu Ser Glu Gly Val His Ser Leu Val Asp Thr Ala Asp 
35 40 45 

Gly Leu Leu Leu Leu Leu Gly Lys His Arg Ser Ala Arg Pro Pro Asp 
50 55 60 

Ala Glu His Pro Phe Gly His Gly Lys Glu Leu Tyr Phe Trp Thr Leu 
65 70 75 80 

He Val Ala He Met He Phe Ala Ala Gly Gly Gly Val Ser lie Tyr 
85 90 95 

Glu Gly He Leu His Leu Leu His Pro Arg Gin He Glu Asp Pro Thr 
100 105 HO 

Trp Asn Tyr Val Val Leu Gly Ala Ala Ala Val Phe Glu Gly Thr Ser 
115 120 125 

Leu He He Ser He His Glu Phe Lys Lys Lys Asp Gly Gin Gly Tyr 
130 135 140 

Leu Ala Ala Met Arg Ser Ser Lys Asp Pro Thr Thr Phe Thr He Val 
145 150 155 160 

Leu Glu Asp Ser Ala Ala Leu Ala Gly Leu Thr He Ala Phe Leu Gly 
165 170 175 

Val Trp Leu Gly His Arg Leu Gly Asn Pro Tyr Leu Asp Gly Ala Ala 
180 185 190 

Ser He Gly He Gly Leu Val Leu Ala Ala Val Ala Val Phe Leu Ala 
195 200 205 
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Ser 


Gin 
210 


Ser Arg Gly Leu 


Leu Val Gly Glu 
215 


Ser 


Ala 
220 


Asp Arg 


Glu Leu 


Leu 
225 


Ala 


Ala 


He Arg 


Ala 
230 


Leu 


Ala 


Ser Ala 


Asp 
235 


Pro 


Gly Val 


Ser Ala 
240 


Val 


Gly 


Arg 


Pro Leu 
245 


Thr 


Met 


His 


Phe Gly 
250 


Pro 


His 


Glu Val 


Leu Val 
255 


Val 


Leu 


Arg 


He Glu 
260 


Phe 


Asp 


Ala 


Ala Leu 
265 


Thr 


Ala 


Ser Gly Val Ala 
270 


Glu 


Ala 


He 
275 


Glu Arg 


He 


Glu 


Thr 
280 


Arg He 


Arg 


Ser 


Glu Arg 
285 


Pro Asp 


Val 


Lys 
290 


His 


He Tyr 


Val 


Glu 
295 


Ala 


Arg Ser 


Leu 


His 
300 


Gin Arg 


Ala Arg 



Ala 
305 

<210> 23 

<211> 135 

<212> PRT 

<213> Sorangium cellulosum 

Vai°Gln 3 Thr Ser Ser Phe Asp Ala Arg Tyr Ala Gly Cys Lys Ser Ser 

1 5 10 15 

Arg Arg He Ala Arg Ser Gly Ser Ala Gly Ala Arg Ala Gly Arg Ala 
20 25 30 

His Glu Gly Ala Ala Ser Ala Gly Phe Glu Gly Gly Asp Val Met Arg 
35 40 45 

Lys Ala Arg Ala His Gly Ala Met Leu Gly Gly Arg Asp Asp Gly Trp 
50 55 60 

Arg Arg Gly Leu Pro Gly Ala Gly Ala Leu Arg Ala Ala Leu Gin Arg 
65 70 75 ttU 

Gly Arg Ser Arg Asp Leu Ala Arg Arg Arg Leu He Ala Ser Val Ser 
85 .90 95 

Leu Ala Gly Gly Ala Ser Met Ala Val Val Ser Leu Phe Gin Leu Gly 
100 105 110 

He He Glu Arg Leu Pro Asp Pro Pro Leu Pro Gly Phe Asp Ser Ala 
115 120 125 

Lys Val Thr Ser Ser Asp He 
130 135 



<210> 24 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> . , 

<223> Description of Artificial Sequence: universal 

reverse primer 



<400> 24 



4-30582A 



86 



ggaaacagct atgaccatg 

<210> 25 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence: universal 

forward primer 
<400> 25 

gtaaaacgac ggccagt 

<210> 26 
<211> 28 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: PCR primer 

NH24 end "B" 
<400> 26 

gtgactggcg cctggaatct gcatgagc 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer NH2 
end "A" 

<400> 27 

agcgggagct tgctagacat tctgtttc 

<210> 28 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer NH2 
end "B" 

<400> 28 

gacgcgcctc gggcagcgcc ccaa 

<210> 29 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
pEPO!5-NH6 end "B" 

<400> 29 

caccgaagcg tcgatctggt ccatc 

<210> 30 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



4-30582A 



-87 



<223> Description of Artificial Sequence: PCR primer 
pEP015H2.7 end "A" 

<400> 30 

cggtcagatc gacgacgggc tttcc 



